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2 ( 57 ) Abstract: The invention relates generally to genes that encode proteins that inhibit axonaJ growth. The invention relates specif- 
ically to genes encodign NgR protein homologs in humans and mice. The invention also includes compositions and methods for 
modulating the expression and activity of Nogo and the NgR proteins. Specifically, the invention includes peptides, proteins and 
antibodies that block Nogo-mediated inhibition of axonal extension. The compositions and methods of the invention are useful in 

^ the treatment of cranial o cerebral trauma, sprial cord injury, stroke or a demyelinating disease. 
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NOGO RECEPTOR HOMOLOGS 



FIELD OF THE INVENTION 

5 The invention relates to neurology and molecular biology. More particularly, 

the invention relates to CNS neurons and axonal growth 

BACKGROUND 

Among the mechanisms through which the cells of an organism communicate 

10 with each other and obtain information and stimuli from their environment is through 
cell membrane receptor molecules expressed on the cell surface. Many such receptors 
have been identified, characterized, and sometimes classified into major receptor 
superfamilies based on structural motifs and signal transduction features. The receptors 
are a first essential link for translating an extracellular signal into a cellular 

15 physiological response. 

Receptors on neurons are particularly important in the development of the 
nervous system during embryogenesis. The neurons form connections with target cells 
during development through axonal extension of the neurons toward the target cells in 
a receptor-mediated process. Axons and dendrites have a specialized region of their 

20 distal tips known as the growth cone. Growth cones enable the neuron to sense the 

local environment through a receptor-mediated process and direct the movement of the 
axon or dendrite of the neuron toward the neuron's target cell. This process is known 
as elongation. Growth cones can be sensitive to several guidance cues, for example, 
surface adhesiveness, growth factors, neurotransmitters and electric fields. The 

25 guidance of growth at the cone depends on various classes of adhesion molecules, 
intercellular signals, as well as factors that stimulate and inhibit growth cones. 

Interestingly, damaged neurons do not elongate in the central nervous system 
(CNS) following injury due to trauma or disease, whereas axons in the peripheral 
nervous system (PNS) regenerate readily. The fact that damaged CNS neurons fail to 

30 elongate is not due to an intrinsic property of CNS axons, but rather due to the CNS 
environment that is not permissive for axonal elongation. Classical grafting 
experiments by Aguayo and colleagues (e.g., Richardson etctl, (1980) Nature 284, 
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264-265) demonstrated that CNS axons can in fact elongate over substantial distances 
within peripheral nerve grafts, and that CNS myelin inhibits CNS axon elongation. 
Therefore, given the appropriate environment, CNS axons can regenerate, implying 
that CNS axonal injury can potentially be addressed by appropriate manipulation of the 
5 CNS environment. 

The absence of axon regeneration following injury can be attributed to the 
presence of axon growth inhibitors. These inhibitors are predominantly associated 
with myelin and constitute an important barrier to regeneration. Axon growth 
inhibitors are present in CNS-derived myelin and the plasma membrane of 

10 oligodendrocytes that synthesize myelin in the CNS (Schwab et aL, (1993) Armu. Rev. 
Neurosci. 16, 565-595)/ Myelin-associated inhibitors appear to be a primary 
contributor to the failure of CNS axon regeneration in vivo after an interruption of 
axonal continuity, whereas other non-myelin associated axon growth inhibitors in the 
CNS may play a lesser role. These inhibitors block axonal regeneration following 

15 neuronal injury due to trauma, stroke or viral infection. 

Numerous myelin-derived axon growth inhibitors have been characterized (see, 
for review, David et aL, (1999) W0995394547; Bandman et aL, (1999) U.S. Patent 
No. 5,858,708; Schwab, (1996) Neurochem. Res. 21, 755-761). Several components 
of CNS white matter, NI35, NI250 (Nogo) and Myelin-associated glycoprotein 

20 (MAG), which have inhibitory activity for axonal extension, have been described as 
well (Schwab et al, (1990) WO9005191; Schwab etaL, (1997) U.S. Patent No. 
5,684,133). In particular, Nogo is a 250 kDa myelin-associated axon growth inhibitor 
that was originally characterized based on the effects of the purified protein in vitro 
and monoclonal antibodies that neutralize the protein's activity (Schwab (1990) Exp. 

25 Neurol 109, 2-5). The Nogo cDNA was first identified through random analysis of 
brain cDNA and had no suggested function (Nagase et aL, (1998) DNA Res. 5, 
355-364). The identification of this Nogo cDNA as the cDNA encoding the 250 kDa 
myelin-associated axon growth inhibitor was discovered only recently (GrandPre et aL, 
(2000) Nature 403, 439-444; Chen et aL, (2000) Nature 403, 434-439; Prinjha at al, 

30 (2000) Nature 403, 383-384). 

Importantly, Nogo has been shown to be the primary component of CNS 
myelin responsible for inhibiting axonal elongation and regeneration. Nogo's selective 
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expression by oligodendrocytes and not by Schwann cells (the cells that myelinate P.S. 
axons) is consistent with the inhibitory effects of CNS myelin, in contrast to P.S. 
myelin (GrandPre et al, (2000) Nature 403, 434-439). In culture, Nogo inhibits 
axonal elongation and causes growth cone collapse (Spillmann et al, (1998) J. Biol 
5 Chem. 272, 19283-19293). Antibodies (e.g., IN-1) against Nogo have been shown to 
block most of the inhibitory action of CNS myelin on neurite growth in vitro 
(Spillmann etal, (1998) J. Biol Chem. 272:19283-19293). These experiments 
indicate that Nogo is the main component of CNS myelin responsible for inhibition of 
axonal elongation in culture. Furthermore, in vivo, the IN-1 antibody has been shown 

10 to enhance axonal regeneration after spinal cord injury, resulting in recovery of 

behaviors such as contact placing and stride length (Schnell and Schwab (1990) Nature 
343, 269-272; Bregman et al, (1995) Nature 378, 498-501). Thus, there is 
substantial evidence that Nogo is a disease-relevant molecular target. Agents that 
interfere with the binding of Nogo to its receptor would be expected to improve axonal 

1 5 regeneration in clinical states in which axons have been damaged, and improve patient 
outcome. 

Modulation of Nogo has been described as a means for treatment of 
regeneration for neurons damaged by trauma, infarction and degenerative disorders of 
the CNS (Schwab etal, (1994) W09417831; Tatagibae/a/., (1997) Neurosurgery 
20 40, 541-546) as well as malignant tumors in the CNS such as glioblastoma (Schwab et 
al, (1993) U.S. Patent No. 5,250,414); Schwab et al, (2000) U.S. Patent No. 
6,025,333). 

Antibodies which recognize Nogo have been suggested to be useful in the 
diagnosis and treatment of nerve damage resulting from trauma, infarction and 

25 degenerative disorders of the CNS (Schnell & Schwab, (1990) Nature 343, 269-272; 
Schwab et al, (1997) U.S. Patent No. 5,684,133). For CNS axons, there is a 
correlation between the presence of myelin and the inhibition of axon regeneration 
over long distances (Savio and Schwab (1990) Proa Natl Acad. Set 87, 4130-4133; 
Keirsteadefa/., (\992)Proc. Natl Acad Sci. 89, 11664-11668). After Nogo is 

30 blocked by antibodies, neurons can again extend across lesions caused by nerve 
damage (Schnell and Schwab (1990) Nature 343, 269-272). 
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SUMMARY OF THE INVENTION 

Genes encoding homologs (NgR2 and NgR3) of a Nogo receptor (NgRl) in 
mice and humans have been discovered. Various domains in the polypeptides encoded 
by the NgR2 and NgR3 genes have been identified and compared to domains in mouse 
5 and human NgRl polypeptides. This comparison has led to identification of a 

consensus sequence (NgR consensus sequence) that characterizes a family of proteins 
(NgR family). Based on these and other discoveries, the invention features molecules 
and methods for modulating axonal growth in CNS neurons. 

The invention provides a polypeptide that contains a polypeptide containing a 
10 tryptophan rich LRRCT domain consisting of the amino acid sequence: 

NX 1 WX 2 CX 3 CRARX 4 LWX 5 WX 6 X 7 X 8 X 9 RX 10 SSSX U V 

Xj 2 C X 13 X 14 P X 15 X 16 X 17 X 18 X 19 X 2 q D L X 21 X22 L X 23 X^ X 2 5 D 

15 

X 26 X 27 X 28 C[SEQIDNO: 19] 



wherein X is any protein amino acid or a gap, and the polypeptide does not 
include amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 
20 (human NgRl) or SEQ ID NO: 17 (mouse NgRl). 



Preferably, X17 and X23 are (independently) arginine or lysine. In some 
embodiments, the amino acid sequence of the LRRCT domain is residues 261-3 10 of 
SEQ ID NO:2, or residues 261-3 10 of SEQ ID NO: 2 with up to 10 conservative 
25 amino acid substitutions. In some embodiments, the polypeptide contains the 
following NTLRRCT amino acid sequence: 

CPX 1 X 2 CX 3 CYX 4 X 5 PX 6 X 7 TX 8 SCX 9 X 10 X n X 12 X 13 X 14 X 15 X 16 P 
X 17 X 18 X 19 P X 20 X21 X 22 X^ R X 24 F L X 2S X^ N X 27 1 X^ X^ X 30 X 31 X 32 X 33 
X 34 F X 35 X 36 X 37 X 38 X 39 X 40 X« X 42 L W X 43 X^ S N X 45 X^ X^ X^ I X 49 
30 X S0 X sl X a FX D X j4 X 5S X 5( X 57 LEX 5l LDLX 59 DNX <0 X a LX o XoX w 
P ^66 T F X^ G L Xgg X 69 L X 70 X 71 L X^ L X7 3 X; 4 C X 75 L X 76 X 77 L Xyg 
X79 Xg 0 Xg, F X»2 G L Xg 3 L Q Y L Y L Q X 8J N X 86 Xg 7 Xg g Xg 9 L D 
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^112 



160 



wherein X is any amino acid residue or a gap and wherein the polypeptide is not the 

10 polypeptide of SEQ ID NO: 5 (human NgRl) or SEQ ID NO: 17 (mouse NgRl). For 
example, X<j, X 37 and X 38 may represent a gap. Specific examples of polypeptides of 
the invention are SEQ ID NO: 2 (human NgR2), SEQ ID NO: 4 (mouse NgR3), and 
SEQ ID NO: 14 (human NgR3). In some embodiments, the polypeptide contains: (a) 
a NTLRRCT domain, and (b) less than a complete CTS domain, provided that a partial 

15 CTS domain, if present, consists of no more than the first 39 amino acids of the CTS 
domain. While the polypeptide may contain a functional GPI domain, a functional GPI 
domain may be absent, e.g., when a soluble polypeptide is desired. A polypeptide of 
the invention optionally includes an amino acid sequence of a heterologous 
polypeptide, e.g., an Fc portion of an antibody. 

20 The invention also provides a nucleic acid encoding an above-described 

polypeptide; a vector containing the nucleic acid, which nucleic acid may be operably 
linked to an expression control sequence; and a transformed host cell containing the 
vector. A method of producing a polypeptide of the invention is also provided. The 
method includes introducing a nucleic acid encoding the above-described polypeptide 

25 into a host cell, culturing the cell under conditions suitable for expression of the 
polypeptide, and recovering the polypeptide. 

The invention also provides an antisense molecule whose nucleotide sequence 
is complementary to a nucleotide sequence encoding a polypeptide selected from the 
group consisting of: a polypeptide consisting of residues 311-395 of SEQ ID NO: 2, a 

30 polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a polypeptide 

consisting of residues 321-438 of SEQ ID NO: 4, wherein the nucleic acid is from 8 to 
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100 nucleotides in length, e.g., about 20, 30, 40, 50, 60, 70, 80 or 90 nucleotides. The 
invention also provides a nucleic acid encoding such an antisense molecule. 

The invention also provides an antibody that binds to an above-described 
polypeptide. Polypeptides or antibodies of the invention can be formulated into 
5 pharmaceutical compositions containing the polypeptide or antibody and a 
pharmaceutically acceptable carrier. 

The invention also provides a method for decreasing inhibition of axonal 
growth of a CNS neuron. The method includes the step of contacting the neuron with 
an effective amount of a polypeptide or antibody of the invention. 

10 The invention also provides a method for treating a central nervous system disease, 
disorder or injury. The method includes administering to a mammal, e.g., a human, an 
effective amount of a polypeptide or antibody of the invention. Exemplary diseases, 
disorders and injuries that may be treated using molecules and methods of the 
invention include, but are not limited to, cerebral injury, spinal cord injury, stroke, 

15 demyelinating diseases, e.g., multiple sclerosis, monophasic demyelination, 
encephalomyelitis, multifocal leukoencephalopathy, panencephalitis, 
Marchiafava-Bignami disease, Spongy degeneration, Alexander's disease, Canavan's 
disease, metachromatic leukodystrophy and Krabbe's disease. 

The invention also provides a method for identifying a molecule that binds a 

20 ' polypeptide of the invention. The method includes the steps of: (a) providing a 
polypeptide of the invention; (b) contacting the polypeptide with the candidate 
molecule; and (c) detecting binding of the candidate molecule to the polypeptide. 

Unless otherwise defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which the 

25 invention belongs. In case of conflict, the present application, including definitions, 
will control. All publications, patent and other references mentioned herein are 
incorporated by reference. 

The materials, methods and examples presented below are illustrative only, and 
not intended to be limiting. Other features and advantages of the invention will be 

30 apparent from the detail description and from the claims. 

BRIEF DESCRIPTION OF THE FIGURES 
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Fig. 1 A-1B shows an alignment of NgR2 (SEQ ID NO:2) and NgR3 (SEQ ID 
NO:4) with the known NgR, NgRl (SEQ ID NO:5) and the Consensus Sequence 
(SEQIDNO:6). 

Fig. 2. mNgR3 does not bind hNogoA(1055-l 120). COS-7 cells were 
transfected with vectors encoding myc-NgRl or myc-NgR3, fixed, and stained with 
anti-myc antibodies or AP-hNogoA(1055-l 120). 

Fig.3. An alignment of the amino acid sequences of human NgRl, murine 
NgRl, murine NgR3, human NgR3 and human NgR2. Numbering begins with amino 
acid #1 of murine NgR3. The consensus sequence is listed below. The LRR NT 
domain is indicated by a shaded box; domains LLR 1, LLR 3, LLR 5, and LLR 7 are 
indicated by open boxes; LLR 2, LLR 4, LLR 6 and LLR 8 are indicated by shaded 
boxes; and the LLR CT domain is indicated by a shaded box. Amino acids in bold in 
LLR 8 indicate a conserved glycosylation sites. A dot indicates conserved cystine 
residue in LRR4. Box at C terminus indicates putative GPI signals. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides purified and isolated polynucleotides (e.g., 
DNA sequences and RNA transcripts, both sense and complementary antisense 
strands, both single- and double-stranded, including splice variants thereof) encoding 
NgR. homologs, referred to herein as NgR. Unless indicated otherwise, as used herein, 
the abbreviation in lower case (NgR) refers to a gene, cDNA, RNA or nucleic acid 
sequence, whereas the upper case version (NgR) refers to a protein, polypeptide, 
peptide, oligopeptide, or amino acid sequence. Specific proteins are designated by 
number, e.g., "NgR2" is a human NgR homolog, "NgR3" is a murine-derived NgR 
homolog, and "NgRl " is the known NgR identified by Dr. Stephen Strittmatter. 
Known NgRs are herein referred to as "NgRs." DNA polynucleotides of the invention 
include genomic DNA, cDNA and DNA that has been chemically synthesized in whole 
or in part. 



WO 02/29059 



PCT/US01/31488 



-8- 

Standard reference works setting forth the general principles of recombinant 
DNA technology known to those of skill in the art include Ausubel et al., CURRENT 
Protocols in Molecular Biology, John Wiley & Sons, New York (1998); 
Sambrook et aL, Molecular Cloning: A Laboratory Manual, 2d Ed., Cold 
5 Spring Harbor Laboratory Press, Plainview, New York (1989); Kaufinan et al y Eds., 
HANDBOOK OF MOLECULAR AND CELLULAR METHODS IN BIOLOGY AND MEDICINE, 

CRC Press, Boca Raton (1995); McPherson, Ed., DIRECTED MUTAGENESIS : A 
PRACTICAL APPROACH,.IRL Press, Oxford (1991). 

As used herein, the term "axon" refers to a long cellular protrusion from a 
10 neuron, whereby action potentials are conducted, either to or from the cell body. 

As used herein, the term "axonal growth" refers to an extension of the long 
process or axon, originating at the cell body and proceeded by the growth cone. 

As used herein, the term "central nervous system disorder" refers to any 
pathological state associated with abnormal function of the central nervous system 
15 (CNS). The term includes, but is not limited to, altered CNS function resulting from 
physical trauma to cerebral tissue, viral infection, autoimmune machanisms and genetic 
mutation. 

As used herein, the term "demyelinating disease" refers to a pathological 
disorder characterized by the degradation of the myelin sheath of the oligodendrocyte 
20 cell membrane. 

As used herein, the term "growth cone" refers to a specialized region at the tip 
of a growing neurite that is responsible for sensing the local environment and moving 
the axon toward its appropriate synaptic target cell. 

As used herein, the term "growth cone movement" refers to the extension or 
25 collapse of the growth cone toward a neuron's target cell. 

As used herein, the term "neurite" refers to a process growing out of a neuron. 
As it is sometimes difficult to distinguish a dendrite from in axon in culture, the term 
"neurite" is used for both. 

As used herein, the term "oligodendrocyte" refers to a neuroglial cell of the 
30 CNS whose function is to myelinate CNS axons. 

"Synthesized" as used herein and understood in the art, refers to 
polynucleotides produced by purely chemical, as opposed to enzymatic, methods. 
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"Wholly" synthesized DNA sequences are therefore produced entirely by chemical 
means, and "partially" synthesized DNAs embrace those wherein only portions of the 
resulting DNA were produced by chemical means. By the term "region" is meant a 
physically contiguous portion of the primary structure of a biomolecule. In the case of 
5 proteins, a region is defined by a contiguous portion of the amino acid sequence of that 
protein. The term "domain" is herein defined as referring to a structural part of a 
biomolecule that contributes to a known or suspected function of the biomolecule. 
Domains may be co-extensive with regions or portions thereof, domains may also 
incorporate a portion of a biomolecule that is distinct from a particular region, in 

10 addition to all or part of that region. Examples of NgR protein domains include, but 
are not limited to, the signal peptide, extracellular (i.e., N-terminal) domain, and 
leucine-rich repeat domains. 

As used herein, the term "activity" refers to a variety of measurable indicia 
suggesting or revealing binding, either direct or indirect; affecting a response, Le., 

15 having a measurable affect in response to some exposure or stimulus, including, for 
example, the affinity of a compound for directly binding a polypeptide or 
polynucleotide of the invention, or, for example, measurement of amounts of upstream 
or downstream proteins or other similar functions after some stimulus or event. Such 
activities may be measured by assays such as competitive inhibition of NgRl binding to 

20 Nogo assays wherein, for example, unlabeled, soluble NgR2 is added to an assay 

system in increasing concentrations to inhibit the binding of Nogo to NgRl expressed 
on the surface of CHO cells. As another example, one may assess the ability of 
neurons to extend across lesions caused by nerve damage (as in Schnell and Schwab 
(1990) Nature 343, 269-272) following inhibition of Nogo by various forms of NgR2 

25 and/or NgR3 as a biological indicator of NgR function. 

As used herein, the term "antibody" is meant to refer to complete, intact 
antibodies, and Fab, Fab', F(ab)2, and other fragments thereof Complete, intact 
antibodies include monoclonal antibodies such as murine monoclonal antibodies, 
chimeric antibodies, anti-idiotypic antibodies, anti-anti-idiotypic antibodies, and 

30 humanized antibodies. 

As used herein, the term "binding" means the physical or chemical interaction 
between two proteins or compounds or associated proteins or compounds or 
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combinations thereof. Binding includes ionic, non-ionic, hydrogen bonds, Van der 
Waals, hydrophobic interactions, etc. The physical interaction, the binding, can be 
either direct or indirect, indirect being through or due to the effects of another protein 
or compound. Direct binding refers to interactions that do not take place through or 
5 due to the effect of another protein or compound but instead are without other 
substantial chemical intermediates. 

As used herein, the term "compound" means any identifiable chemical or 
molecule, including, but not limited to, small molecules, peptides, proteins, sugars, 
nucleotides or nucleic acids, and such compound can be natural or synthetic. 

10 As used herein, the term "complementary" refers to Watson-Crick basepairing 

between nucleotide units of a nucleic acid molecule. 

As used herein, the term "contacting" means bringing together, either directly 
or indirectly, a compound into physical proximity to a polypeptide or polynucleotide of 
the invention. The polypeptide or polynucleotide can be in any number of buffers, 

15 salts, solutions etc. Contacting includes, for example, placing the compound into a 
beaker, microtiter plate, cell culture flask, or a microarray, such as a gene chip, or the 
like, which contains the nucleic acid molecule, or polypeptide encoding the NgR or 
fragment thereof 

As used herein, the phrase "homologous nucleotide sequence," or "homologous 
20 amino acid sequence," or variations thereof, refers to sequences characterized by an 
identity at the nucleotide level, or a homology at the amino acid level, of at least the 
specified percentage. Homologous nucleotide sequences include those sequences 
coding for isoforms of proteins. Such isoforms can be expressed in different tissues of 
the same organism as a result of, for example, alternative splicing of RNA. 
25 Alternatively, isoforms can be encoded by different genes. Homologous nucleotide 
sequences include nucleotide sequences encoding for a protein of a species other than 
humans, including, but not limited to, mammals. Homologous nucleotide sequences 
also include, but are not limited to, naturally occurring allelic variations and mutations 
of the nucleotide sequences set forth herein. A homologous nucleotide sequence does 
30 not, however, include the nucleotide sequence encoding NgRl. Homologous amino 
acid sequences include those amino acid sequences which contain conservative amino 
acid substitutions and which polypeptides have the same binding and/or activity. A 
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homologous amino acid sequence does not, however, include the amino acid sequence 
encoding other known NgRs. Percent homology can be determined by, for example, 
the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, 
Genetics Computer Group, University Research Park, Madison WI), using the default 
5 settings, which uses the algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2, 
482-489, which is incorporated herein by reference in its entirety). 

As used herein, the term "isolated" nucleic acid molecule refers to a nucleic 
acid molecule (DNA or RNA) that is substantially free of nucleic acids encoding other 
proteins with which it is associated in nature, i.e., a nucleic acid that has been removed 

10 from its native environment. Examples of isolated nucleic acid molecules include, but 
are not limited to, recombinant DNA molecules contained in a vector, recombinant 
DNA molecules maintained in a heterologous host cell, partially or substantially 
purified nucleic acid molecules, and synthetic DNA or RNA molecules. Preferably, an 
"isolated" nucleic acid is free of sequences which naturally flank the nucleic acid (i.e., 

1 5 sequences located at the 5 1 and 3 1 ends of the nucleic acid) in the genomic DNA of the 
organism from which the nucleic acid is derived. For example, in various 
embodiments, the isolated NgR nucleic acid molecule can contain less than about 50 
kb, 25 kb, 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which 
naturally flank the nucleic acid molecule in genomic DNA of the cell from which the 

20 nucleic acid is derived. Moreover, an "isolated" nucleic acid molecule, such as a 

cDNA molecule, can be substantially free of other cellular material or culture medium 
when produced by recombinant techniques, or of chemical precursors or other 
chemicals when chemically synthesized. 

As used herein, the term "heterologous" refers to a nucleotide or amino acid 

25 sequence that is a different, or non-corresponding sequence, or a sequence derived 

from a different species. For example, a mouse NgR nucleotide or amino acid sequence 
is heterologous to a human Ngjl nucleotide or amino acid sequence, and a human NgR 
nucleic or amino acid sequence is heterologous to a human immunoglobulin nucleotide 
or amino acid sequence. 

30 As used herein, a "soluble NgR polypeptide" is a NgR polypeptide that does 

not anchor itself in a membrane. Such soluble polypeptides include, for example, 
NgR2 and NgR3 polypeptides that lack a sufficient portion of their GPI anchor signal 
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to anchor the polypeptide or are modified such that the GPI anchor signal is not 
adequate to result in replacement of the peptide with a GPI anchor. In preferred 
embodiments, up to 5, 10, 20 or 25 amino acids are removed from the C-terminus of 
NgR2 or NgR3 to make the respective proteins soluble. As used herein soluble NgR 
5 polypeptides include full-length or truncated (e.g., with internal deletions) NgR. 

Soluble NgR polypeptides may include the entire NgR protein up to the 
putative GPI signal sequence (e.g., amino acid 1 to about amino acid 395 of NgR2, 
and from amino acid 1 to about amino acid 438 of NgR3). In other embodiments, the 
signal peptide of the proteins may be removed or truncated (e.g., all or part of the 
10 signal sequence of NgR2, which spans amino acid 1 to about amino acid 30 of SEQ ID 
NO:2, may be removed; all or part of the signal sequence of NgR3, which spans amino 
acid 1 to about amino acid 40 of SEQ ID NO:4, may be removed). In some 
embodiments, the mature NgR2 (SEQ ID NO: 8) and the mature NgR3 (SEQ ID 
NO:9) are used. 

1 5 Soluble NgR polypeptides include at least one of the putative ligand-binding 

portions of NgR, including the first cysteine-rich region (SEQ ID NO: 10, the leucine 
repeat region (SEQ ID NO: 12) and the second cysteine-rich region (SEQ ID NO: 1 1). 
In some embodiments, soluble NgR polypeptides consist of amino acid 1 through 
about amino acid 395 of SEQ ID NO. 2, or amino acid 1 through about amino acid 

20 438 ofSEQIDNO:4. 

In other embodiments, the soluble NgR polypeptides are fusion proteins that 
contain amino acids 30 through about amino acid 395 of mature NgR2 or amino acid 
40 through about amino acid 438 of NgR3, the C-terminal 10 amino acids of a human 
IgG 1 hinge region containing the two cysteine residues thought to participate in 

25 interchain disulfide bonding, and the CH2 and CH3 regions of a human IgGI heavy 
chain constant domain. This type of recombinant protein is designed to modulate 
inhibition of axonal elongation through inhibition of the Nogo ligand binding to NgRl, 
or by inhibiting the ligand of the NgR from interacting with cell surface NgR. The 
NgR portion of the fusion binds to the Nogo ligand and the IgGI portion binds to the 

30 FcyRI (macrophage) and Fcylll (NK cells and neutrophils) receptors. 

The production of the soluble polypeptides useful in this invention may be 
achieved by a variety of methods known in the art. For example, the polypeptides may 
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be derived from intact transmembrane NgR molecules by proteolysis using specific 
endopeptidases in combination with exopeptidases, Edman degradation, or both. The 
intact NgR molecule, in turn, may be purified from its natural source using 
conventional methods. Alternatively, the intact NgR may be produced by known 
5 recombinant DNA techniques using cDNAs, expression vectors and well-known 
techniques for recombinant gene expression, 

Preferably, the soluble polypeptides useful in the present invention are 
produced directly, thus eliminating the need for an entire NgR as a starting material. 
This may be achieved by conventional chemical synthesis techniques or by well-known 

10 recombinant DNA techniques wherein only those DNA sequences which encode the 
desired peptides are expressed in transformed hosts. For example, a gene which 
encodes the desired soluble NgR polypeptide may be synthesized by chemical means 
using an oligonucleotide synthesizer. Such oligonucleotides are designed based on the 
amino acid sequence of the desired soluble NgR polypeptide. Specific DNA sequences 

15 coding for the desired peptide also can be derived from the full- length DNA sequence 
by isolation of specific restriction endonuclease fragments or by PCR synthesis of the 
specified region from cDNA. 

A nucleic acid molecule of the present invention, e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NOs: 1, 3 or a complement of either of 

20 these nucleotide sequences, can be isolated using standard molecular biology 

techniques and the sequence information provided herein. Using all or a portion of the 
nucleic acid sequences of SEQ ID NOs: 1 or 3 as a hybridization probe, NgR nucleic 
acid sequences can be isolated using standard hybridization and cloning techniques 
(e.g., as described in Sambrook et aL, eds., Molecular Cloning: aLaboratory 

25 MANUAL 2 nd Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 

1989; and Ausubei, etal, eds., CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John 
Wiley & Sons, New York, NY, 1993). 

A nucleic acid of the invention can be amplified using cDNA, mRNA or 
alternatively, genomic DNA, as a template and appropriate oligonucleotide primers 

30 according to standard PCR amplification techniques. The nucleic acid so amplified can 
be cloned into an appropriate vector and characterized by DNA sequence analysis. 
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Furthermore, oligonucleotides corresponding to NgR nucleotide sequences can be 
prepared by standard synthetic techniques, e.g., using an automated DNA synthesizer. 

As used herein, the terms "modulates" or "modifies" means an increase or 
decrease in the amount, quality, or effect of a particular activity or protein. 
5 As used herein, the term "oligonucleotide" refers to a series of linked 

nucleotide residues which has a sufficient number of bases to be used in a polymerase 
chain reaction (PCR). This short sequence is based on (or designed from) a genomic 
or cDNA sequence and is used to amplify, confirm or reveal the presence of an 
identical, similar or complementary DNA or &NA in a particular cell or tissue. 

10 Oligonucleotides comprise portions of a DNA sequence having at least about 10 
nucleotides and as many as about 50 nucleotides, preferably about 15 to 30 
nucleotides. They are chemically synthesized and may be used as probes. 

As used herein, the term "probe" refers to nucleic acid sequences of variable 
length, preferably between at least about 10 and as many as about 6,000 nucleotides, 

15 depending on use. They are used in the detection of identical, similar or 

complementary nucleic acid sequences. Longer length probes are usually obtained from 
a natural or recombinant source, are highly specific and much slower to hybridize than 
oligomers. They may be single- or double-stranded and carefully designed to have 
specificity in PCR, hybridization membrane-based, or ELIS A-like technologies. 

20 The term "preventing" refers to decreasing the probability that an organism 

contracts or develops an abnormal condition. 

The term "treating" refers to having a therapeutic effect and at least partially 
alleviating or abrogating an abnormal condition in the organism. 

The term "therapeutic effect" refers to the inhibition or activation factors 

25 causing or contributing to the abnormal condition. A therapeutic effect relieves to 
some extent one or more of the symptoms of the abnormal condition. In reference to 
the treatment of abnormal conditions, a therapeutic effect can refer to one or more of 
the following: (a) an increase in the proliferation, growth, and/or differentiation of 
cells; (b) inhibition (i.e., slowing or stopping) of cell death; (c) inhibition of 

30 degeneration; (d) relieving to some extent one or more of the symptoms associated 
with the abnormal condition; and (e) enhancing the function of the affected population 
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of cells. Compounds demonstrating efficacy against abnormal conditions can be 
identified as described herein. 

The term "abnormal condition" refers to a Sanction in the cells or tissues of an 
organism that deviates from their normal functions in that organism. An abnormal 
5 condition can relate to cell proliferation, cell differentiation, cell signaling, or cell 

survival. An abnormal condition may also include obesity, diabetic complications such 
as retinal degeneration, and irregularities in glucose uptake and metabolism, and fatty 
acid uptake and metabolism. 

Abnormal cell proliferative conditions, for example, include cancers such as 
10 fibrotic and mesangial disorders, abnormal angiogenesis and vasculogenesis, wound 
healing, psoriasis, diabetes mellitus and inflammation. 

Abnormal differentiation conditions include, for example, neurodegenerative 
disorders, slow wound healing rates and slow tissue grafting healing rates. 

Abnormal cell signaling conditions include, for example, psychiatric disorders 
1 5 involving excess neurotransmitter activity. 

Abnormal cell survival conditions may also relate to conditions in which 
programmed cell death (apoptosis) pathways are activated or abrogated. A number of 
protein kinases are associated with the apoptosis pathways. Aberrations in the 
function of any one of the protein kinases could lead to cell immortality or premature 
20 cell death. 

The term "administering" relates to a method of incorporating a compound into 
cells or tissues of an organism. The abnormal condition can be prevented or treated 
when the cells or tissues of the organism exist within the organism or outside of the 
organism. Cells existing outside the organism can be maintained or grown in cell 

25 culture dishes. For cells harbored within the organism, many techniques exist in the art 
to administer compounds, including (but not limited to) oral, parenteral, dermal, 
injection, and aerosol applications. For cells outside of the organism, multiple 
techniques exist in the art to administer the compounds, including (but not limited to) 
cell microinjection techniques, transformation techniques and carrier techniques. 

30 The abnormal condition can also be prevented or treated by administering a 

compound to a group of cells having an aberration in a signal transduction pathway to 
an organism. The effect of administering a compound on organism function can then 
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be monitored. The organism is preferably a mouse, rat, rabbit, guinea pig or goat, 
more preferably a monkey or ape, and most preferably a human. 

By "amplification" it is meant increased numbers of DNA or RNA in a cell 
compared with normal cells. "Amplification" as it refers to RNA can be the detectable 
5 presence of RNA in cells, since in some normal cells there is no basal expression of 
RNA. In other normal cells, a basal level of expression exists, therefore in these cases 
amplification is the detection of at least 1-2-fold, and preferably more, compared to 
the basal level. 

The amino acid sequences are presented in the amino to carboxy direction, 
10 from left to right. The amino and carboxy groups are not presented in the sequence. 
The nucleotide sequences are presented by single strand only, in the 5* to 3 1 direction, 
from left to right. Nucleotides and amino acids are represented in the manner 
recommended by the IUPAC-IUB Biochemical Nomenclature Commission or (for 
amino acids) by three letters code. 

15 

Nucleic Acids 

Genomic DNA of the invention comprises the protein-coding region for a 
polypeptide of the invention and is also intended to include allelic variants thereof. It is 
widely understood that, for many genes, genomic DNA is transcribed into RNA 

20 transcripts that undergo one or more splicing events wherein intron (i.e., non-coding 
regions) of the transcripts are removed, or "spliced out." RNA transcripts that can be 
spliced by alternative mechanisms, and therefore be subject to removal of different 
RNA sequences but still encode a NgR polypeptide, are referred to in the art as splice 
variants which are embraced by the invention. Splice variants comprehended by the 

25 invention therefore are encoded by the same original genomic DNA sequences but 
arise from distinct mRNA transcripts. Allelic variants are modified forms of a 
wild-type gene sequence, the modification resulting from recombination during 
chromosomal segregation or exposure to conditions which give rise to genetic 
mutation. Allelic variants, like wild-type genes, are naturally occurring sequences (as 

30 opposed to non-naturally occurring variants arising from in vitro manipulation). 

The invention also comprehends cDNA that is obtained through reverse 
transcription of an RNA polynucleotide encoding NgR (conventionally followed by 
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second-strand synthesis of a complementary strand to provide a double-stranded 
DNA). 

Preferred DNA sequences encoding a human NgR polypeptide is set out in 
SEQ ID NOs: 1 and 13. A preferred DNA of the invention comprises a double 
5 stranded molecule comprising the coding molecule (i.e., the "coding strand") along 
with the complementary molecule (the "non-coding strand" or "complement") having a 
sequence unambiguously deducible from the coding strand according to Watson-Crick 
base-pairing rules for DNA Also preferred are other polynucleotides encoding NgR 
polypeptides, as shown in SEQ ID NO:3, which comprises murine NgR homolog, 
10 NgR3. 

Also preferred are nucleotide sequences that encode at least a portion of a NgR 
polypeptide that has at least one biological function of a NgR More preferred are 
nucleotide sequences that encode a portion of NgR that encodes at least the mature 
NgR without the hydrophobic C-terminal GPI signal. Also preferred are nucleotide 
15 sequences that encode the portion of NgR that encodes at least the ligand-binding 
region of NgR. 

The invention further embraces other species, preferably mammalian, homologs 
of the human NgR DNA, Species homologs, sometimes referred to as "orthologs," in 
general, share at least 35%, at least 40%, at least 45%, at least 50%, at least 60%, at 

20 least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 
95%, at least 98%, or at least 99% homology with human DNA of the invention. 
Generally, percent sequence "homology" with respect to polynucleotides of the 
invention may be calculated as the percentage of nucleotide bases in the candidate 
sequence that are identical to nucleotides in the NgR sequences set forth in SEQ ID 

25 NOs:l, 3 or 13, after aligning the sequences and introducing gaps, if necessary, to 
achieve the maximum percent sequence identity. 

The polynucleotide sequence information provided by the invention makes 
possible large-scale expression of the encoded polypeptide by techniques well known 
and routinely practiced in the art. Polynucleotides of the invention also permit 

30 identification and isolation of polynucleotides encoding related NgR polypeptides, such 
as human allelic variants and species homologs, by well-known techniques including 
Southern and/or Northern hybridization, and polymerase chain reaction (PCR). 
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Examples of related polynucleotides include human and non-human genomic 
sequences, including allelic variants, as well as polynucleotides encoding polypeptides 
homologous to NgR and structurally related polypeptides sharing one or more 
biological, immunological, and/or physical properties of NgR. Non-human species 
5 genes encoding proteins homologous to NgR can also be identified by Southern and/or 
PCR analysis and are useful in animal models for NgR disorders. Knowledge of the 
sequence of a human NgR DNA also makes possible through use of Southern 
hybridization or polymerase chain reaction (PCR) the identification of genomic DNA 
sequences encoding NgR expression control regulatory sequences such as promoters, 

10 operators, enhancers, repressors, and the like. Polynucleotides of the invention are 
also useful in hybridization assays to detect the capacity of cells to express NgR 
Polynucleotides of the invention may also provide a basis for diagnostic methods useful 
for identifying a genetic alteration(s) in a NgR locus that underlies a disease state or 
states, which information is useful both for diagnosis and for selection of therapeutic 

15 strategies. 

The disclosure herein of a full-length polynucleotide encoding a NgR 
polypeptide makes readily available to the worker of ordinary skill in the art every 
possible fragment of the full-length polynucleotide. The invention, therefore, provides 
fragments of NgR-encoding polynucleotides comprising at least 6, and preferably at 

20 least 14, 16, 18, 20, 25, 50, or 75 consecutive nucleotides of a polynucleotide 

encoding NgR. Preferably, fragments of polynucleotides of the invention comprise 
sequences unique to the NgR-encoding polynucleotide sequence, and therefore 
hybridize under highly stringent or moderately stringent conditions only (i.e., 
"specifically") to polynucleotides encoding NgR (or fragments thereof). 

25 Polynucleotide fragments of genomic sequences of the invention comprise not only 
sequences unique to the coding region, but also include fragments of the full-length 
sequence derived from introns, regulatory regions, and/or other non-translated 
sequences. Sequences unique to polynucleotides of the invention are recognizable 
through sequence comparison to other known polynucleotides, and can be identified 

3 0 through use of alignment programs routinely utilized in the art, e.g. , those made 
available in public sequence databases. Such sequences also are recognizable from 
Southern hybridization analyses to determine the number of fragments of genomic 
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DNA to which a polynucleotide will hybridize. Polynucleotides of the invention can be 
labeled in a manner that permits their detection, including radioactive, fluorescent and 
enzymatic labeling. 

Fragments of polynucleotides are particularly useful as probes for detection of 

5 full-length or fragment of NgR polynucleotides. One or more polynucleotides can be 
included in kits that are used to detect the presence of a polynucleotide encoding NgR, 
or used to detect variations in a polynucleotide sequence encoding NgR. 

The invention also embraces DNAs encoding Ngjl polypeptides that hybridize 
under moderately stringent or high stringency conditions to the noncoding strand, or 

10 complement, of the polynucleotide in any of SEQ ID NOs: 1 or 3 . 

Stringent conditions are known to those skilled in the art and can be found in 
Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 
6.3.176.3.6. Preferably, the conditions are such that sequences at least about 65%, 
70%, 75%, 85%, 90%, 95%, 98% or 99% homologous to each other typically remain 

15 hybridized to each other. A non-limiting example of stringent hybridization conditions 
is hybridization in a high salt buffer comprising 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 
mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA and 500 mg/ml denatured salmon 
sperm DNA at 65°C. This hybridization is followed by one or more washes in 0.2X 
SSC, 0.01% BSA at 50'C. An isolated nucleic acid molecule of the invention that 

20 hybridizes under stringent conditions to the sequence of SEQ ID NOs: 1 or 3 
corresponds to a naturally occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an RNA or DNA molecule having 
a nucleotide sequence that occurs in nature (e.g., encodes a natural protein). 
As used herein, "stringent hybridization conditions" means: 42°C in a hybridization 

25 solution comprising 50% formamide, 1% SDS, 1 MNaCl, 10% (wt/vol) dextran 

sulfate, and washing twice for 30 minutes at 60°C in a wash solution comprising 0. 1 X 
SSC and 1% SDS. 

Vectors 

30 Another aspect of the present invention is directed to vectors, or recombinant 

expression vectors, comprising any of the nucleic acid molecules described above. 
Vectors are used herein either to amplify DNA or RNA encoding NgR and/or to 
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express DNA which encodes NgR. As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another nucleic acid to which it has been 
linked. One type of vector is a "plasmid", which refers to a circular double stranded 
DNA loop into which additional DNA segments can be ligated. Another type of 
5 vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into 
which they are introduced (e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors). Other vectors (e.g., non-episomal 
mammalian vectors) are integrated into the genome of a host cell upon introduction 

10 into the host cell, and thereby are replicated along with the host genome. Moreover, 
certain vectors are capable of directing the expression of genes to which they are 
operatively linked. Such vectors are referred to herein as "expression vectors". In 
general, expression vectors of utility in recombinant DNA techniques are often in the 
form of plasmids. In the present specification, "plasmid" and "vector" can be used 

15 interchangeably as the plasmid is the most commonly used form of vector. However, 
the invention is intended to include such other forms of expression vectors, such as 
viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- 
associated viruses), that serve equivalent functions. 

Expression of proteins in prokaryotes is most often carried out in R coli with 

20 vectors containing constitutive or inducible promoters directing the expression of 

either fusion or non-fixsion proteins. Fusion vectors add a number of amino acids to a 
protein encoded therein, usually to the amino terminus of the recombinant protein. 
Such fusion vectors typically serve three purposes: (1) to increase expression of 
recombinant protein; (2) to increase the solubility of the recombinant protein; and (3) 

25 to aid in the purification of the recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a proteolytic cleavage site is 
introduced at the junction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from the fusion moiety subsequent to 
purification of the fusion protein. Such enzymes, and their cognate recognition 

30 sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression 
vectors include pGEX (Pharmacia Biotech Inc; Smith and Johnson (1988) Gene 67, 
31-40), pMAL (New England Biolabs, Beverly, Mass.) and pRIT5 (Pharmacia, 
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Piscataway, N.J.) that fuse glutathione-S-transferase (GST), maltose E binding protein, 
or protein A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion 2i. coli expression vectors include 
pTrc (Amrann etal, (1988) Gene 69, 301-315) and pET lid (Studier etal, GENE 
5 Expression Technology: Methods in Enzymology 185, Academic Press, San 
Diego, CA. (1990) 60-89). 

One strategy to maximize recombinant protein expression in E. coli is to 
express the protein in host bacteria with an impaired capacity to proteolytically cleave 
the recombinant protein. See, Gottesman, Gene Expression Technology: 

10 Methods in Enzymology 185, Academic Press, San Diego, CA. (1990) 1 19-128. 
Another strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted 
into an expression vector so that the individual codons for each amino acid are those 
preferentially utilized inE. coli (Wada etal, (1992) Nucleic Acids Res. 20, 2111- 
2118). Such alteration of nucleic acid sequences of the invention can be carried out by 

1 5 standard DNA synthesis techniques. 

In another embodiment, the NgR expression vector is a yeast expression 
vector. Examples of vectors for expression in yeast SL cerevisiae include pYepSecl 
(Baldari, et al, (1987) EMBO J. 6, 229-234), pMFa (Kurjan and Herskowitz (1982) 
O//30, 933-943), pJRY88 (Schultzef a/., (1987) Gene 54, 113-123), pYES2 

20 (Invitrogen Corporation, San Diego, CA), and picZ (InVitrogen Corp, San Diego, 
CA). 

Alternatively, NgR can be expressed in insect cells using baculovirus expression 
vectors. Baculovirus vectors available for expression of proteins in cultured insect 
cells (e.g., SF9 cells) include the pAc series (Smith etal, (1983)M>/. Cell Biol 3, 

25 2156-2165) and the pVL series (Lucklow and Summers (1989) Virology 170, 31-39). 
In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 329, 840) and pMT2PC 
(Kaufinanefa/. (1987) EMBO J. 6, 187-195). When used in mammalian cells, the 

30 expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
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prokaryotic and eukaryotic cells. See, e.g., Chapters 16 and 17 of Sambrook et al, 
(Eds.) Molecular Cloning: A Laboratory Manual. 2 nd Ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 
In another embodiment, the recombinant mammalian expression vector is 
5 capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
k tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al 
(1987) Genes Dev. 1, 268-277), lymphoid-specific promoters (Calame and Eaton 

10 (1988) Adv. Immunol 43, 235-275), in particular promoters of T cell receptors 

(Winoto and Baltimore (1989) EMBO J. 8, 729-733) and immunoglobulins (Banerji et 
al (1983) Cell 33, 729-740; Queen and Baltimore (1983) Cell 33, 741-748), neuron- 
specific promoters (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. 
Natl Acad. Set USA 86, 5473-5477), pancreas-specific promoters (Edlund etal. 

15 (1985) Science 230, 912-916), and mammary gland-specific promoters (e.g., milk 
whey promoter; U.S. Pat. No. 4,873,3 16 and European Application Publication No. 
264,166). Developmentally-regulated promoters are also encompassed, e.g., the 
murine hox promoters (Kessel and Gruss (1990) Science 249, 374-379) and the 
a-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 3, 537-546). 

20 The invention further provides a recombinant expression vector comprising a 

DNA molecule of the invention cloned into the expression vector in an antisense 
orientation. That is, the DNA molecule is operatively linked to a regulatory sequence 
in a manner that allows for expression (by transcription of the DNA molecule) of an 
RNA molecule that is antisense NgR mRNA. Regulatory sequences operatively linked 

25 to a nucleic acid cloned in the antisense orientation can be chosen that direct the 
continuous expression of the antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory sequences can be chosen that 
direct constitutive, tissue-specific or cell-type-specific expression of antisense RNA, 
The antisense expression vector can be in the form of a recombinant plasmid, 

30 phagemid or attenuated virus in which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the activity of which can be determined 
by the cell type into which the vector is introduced. For a discussion of the regulation 
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of gene expression using antisense genes see Weintraub et al, Antisense RNA as a 
molecular tool for genetic analysis, REVIEWS-TRENDS IN GENETICS, Vol. 1(1) 1986. 

Preferred vectors include, but are not limited to, plasmids, phages, cosmids, 
episomes, viral particles or viruses and integratable DNA fragments (i.e., fragments 
5 integratable into the host genome by homologous recombination). Preferred viral 
particles include, but are not limited to, adenoviruses, baculoviruses, parvoviruses, 
herpesviruses, poxviruses, adeno-associated viruses, Semliki Forest viruses, vaccinia 
viruses and retroviruses. Preferred expression vectors include, but are not limited to, 
pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). Other expression vectors 

10 include, but are not limited to, pSPORT™ vectors, pGEM™ vectors (Promega), 

pPROEXvectors™ (LTI, Bethesda, MD), Bluescript™ vectors (Stratagene), pQE™ 
vectors (Qiagen), pSE420™ (Invitrogen) and pYES2™(Invitrogen). 

Preferred expression vectors are replicable DNA constructs in which a DNA 
sequence encoding NgR is operably linked or connected to suitable control sequences 

15 capable of effecting the expression of the NgR in a suitable host. DNA regions are 
operably linked or connected when they are functionally related to each other. For 
example, a promoter is operably linked or connected to a coding sequence if it controls 
the transcription of the sequence. Amplification vectors do not require expression 
control domains, but rather need only the ability to replicate in a host, usually 

20 conferred by an origin of replication, and a selection gene to facilitate recognition of 
transformants. The need for control sequences in the expression vector will vary 
depending upon the host selected and the transformation method chosen. Generally, 
control sequences include, but are not limited to a transcriptional promoter, enhancers, 
an optional operator sequence to control transcription, polyadenylation signals, a 

25 sequence encoding suitable mRNA ribosomal binding and sequences which control the 
termination of transcription and translation. Such regulatory sequences are described, 
for example, in Goeddel, Gene Expression Technology: Methods in 
ENZYMOLOGY 185, Academic Press, San Diego, CA (1990). Regulatory sequences 
include those that direct constitutive expression of a nucleotide sequence in many types 

30 of host cell and those that direct expression of the nucleotide sequence only in certain 
host cells {e.g., tissue-specific regulatory sequences). It will be appreciated by those 
skilled in the art that the design of the expression vector can depend on such factors as 
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the choice of the host cell to be transformed, the level of expression of protein desired, 
etc. The expression vectors of the invention can be introduced into host cells to 
thereby produce proteins or peptides, including fusion proteins or peptides, encoded by 
nucleic acids as described herein (e.g., NgR proteins, mutant forms of NgR, fusion 
5 proteins, etc.). 

Preferred vectors preferably contain a promoter that is recognized by the host 
organism. The promoter sequences of the present invention may be prokaryotic, 
eukaryotic or viral. Examples of suitable prokaryotic sequences include the PR and PL 
promoters of bacteriophage lambda (The Bacteriophage Lambda, Hershey, A.D. 

10 (Ed.), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1973), which is 
incorporated herein by reference in its entirety; LAMBDA EE, Hendrix, R.W. (Ed.), Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1980), which is 
incorporated herein by reference in its entirety); the trp, recA, heat shock, and lacZ 
promoters of E. coli and the SV40 early promoter (Benoist et aL t (1981) Nature 290, 

1 5 304-3 10, which is incorporated herein by reference in its entirety). Additional 

promoters include, but are not limited to, mouse mammary tumor virus, long terminal 
repeat of human immunodeficiency virus, maloney virus, cytomegalovirus immediate 
early promoter, Epstein Barr virus, Rous sarcoma virus, human actin, human myosin, 
human hemoglobin, human muscle creatine and human metallothionein. 

20 Additional regulatory sequences can also be included in preferred vectors. 

Preferred examples of suitable regulatory sequences are represented by the 
Shine-Dalgarno sequence of the replicase gene of the phage MS-2 and of the gene ell 
of bacteriophage lambda. The Shine-Dalgarno sequence may be directly followed by 
DNA encoding NgR and result in the expression of the mature NgR protein. 

25 Moreover, suitable expression vectors can include an appropriate marker that 

allows the screening of the transformed host cells. The transformation of the selected 
host is carried out using any one of the various techniques well known to the expert in 
the art and described in Sambrook et al , supra. 

An origin of replication can also be provided either by construction of the 

30 vector to include an exogenous origin or may be provided by the host cell 

chromosomal replication mechanism. If the vector is integrated into the host cell 
chromosome, the latter may be sufficient. Alternatively, rather than using vectors 
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which contain viral origins of replication, one skilled in the art can transform 
mammalian cells by the method of co-transformation with a selectable marker and NgR 
DNA. An example of a suitable marker is dihydrofolate reductase (DHFR) or 
thymidine kinase {see, U.S. Patent No. 4,399,216). 
5 Nucleotide sequences encoding NgR may be recombined with vector DNA in 

accordance with conventional techniques, including blunt-ended or staggered-ended 
termini for ligation, restriction emyme digestion to provide appropriate termini, filling 
in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable 
joining and ligation with appropriate ligases. Techniques for such manipulation are 
10 disclosed by Sambrook et al , supra and are well known in the art. Methods for 

construction of mammalian expression vectors are disclosed in, for example, Okayama 
etal, (1983)M>/. Cell Biol 3:280, Cosman etal (1986) Mol Immunol 23:935, 
Cosman et al, (1984) Nature 3 12:768, EP-A-0367566, and WO 91/18982, each of 
which is incorporated herein by reference in its entirety. 

15 

Host Cells and Transformed Host Cells 

According to another aspect of the invention, host cells are provided, including 
prokaryotic and eukaryotic cells, comprising a polynucleotide of the invention (or 
vector of the invention) in a manner that permits expression of the encoded NgR 

20 polypeptide. Preferably, the cell produces little or no endogenous NgR polypeptide. 
Polynucleotides of the invention may be introduced into the host cell as part of a 
circular plasmid, or as linear DNA comprising an isolated protein coding region or a 
viral vector. Methods for introducing DNA into the host cell that are well known and 
routinely practiced in the art include transformation, transfection, electroporation, 

25 nuclear injection, or fusion with carriers such as liposomes, micelles, ghost cells and 
protoplasts. Expression systems of the invention include bacterial, yeast, fungal, plant, 
insect, invertebrate, vertebrate and mammalian cells systems. 

Host cells of the invention are a valuable source of immunogen for 
development of antibodies specifically immunoreactive with NgR. Host cells of the 

30 invention are also useful in methods for the large-scale production of NgR 

polypeptides wherein the cells are grown in a suitable culture medium and the desired 
polypeptide products are isolated from the cells, or from the medium in which the cells 
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are grown, by purification methods known in the art, e.g., conventional 
chromatographic methods including immunoaflBnity chromatography, receptor affinity 
chromatography, hydrophobic interaction chromatography, lectin affinity 
chromatography, size exclusion filtration, cation or anion exchange chromatography, 
5 high pressure liquid chromatography (HPLC), reverse phase HPLC, and the like. Still 
other methods of purification include those methods wherein the desired protein is . 
expressed and purified as a fusion protein having a specific tag, label or chelating 
moiety that is recognized by a specific binding partner or agent. The purified protein 
can be cleaved to yield the desired protein, or can be left as an intact fusion protein. 

10 Cleavage of the fusion component may produce a form of the desired protein having 
additional amino acid residues as a result of the cleavage process. 

Knowledge of NgRDNA sequences allows for modification of cells to permit, 
or increase, expression of endogenous NgR. Cells can be modified (e.g., by 
homologous recombination) to provide increased expression by replacing, in whole or 

15 in part, the naturally occurring NgR promoter with all or part of a heterologous 

promoter so that the cells express NgR at higher levels. The heterologous promoter is 
inserted in such a manner that it is operatively linked to endogenous NgR encoding 
sequences. (See, for example, PCT International Publication No. WO 94/12650, PCT 
International Publication No. WO 92/20808, and PCT International Publication No. 

20 WO 91/09955.) It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multiftinctional CAD gene 
which encodes carbamoyl phosphate synthase, aspartate transcarbamylase, and 
dihydroorotase) and/or intron DNA may be inserted along with the heterologous 
promoter DNA. If linked to the NgR coding sequence, amplification of the marker 

25 DNA by standard selection methods results in co-amplification of the NgR coding 
sequences in the cells. 

The DNA sequence information provided by the present invention also makes 
possible the development (e.g., by homologous recombination or "knock-out" 
strategies; see Capecchi, Science 244:1288-1292 (1989)) of animals that fail to express 

30 functional NgR or that express a variant of NgR. Such animals (especially small 

laboratory animals such as rats, rabbits and mice) are useful as models for studying the 
in vivo activities of NgR and modulators of NgR. 
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Suitable host cells for expression of the polypeptides of the invention include, 
but are not limited to, prokaryotes, yeast, and eukaryotes. If a prokaryotic expression 
vector is employed, then the appropriate host cell would be any prokaryotic cell 
capable of expressing the cloned sequences. Suitable prokaryotic cells include, but are 
5 not limited to, bacteria of the genera Escherichia, Bacillus, Salmonella, Pseudomonas, 
Streptomyces and Staphylococcus. 

If a eukaryotic expression vector is employed, then the appropriate host cell 
would be any eukaryotic cell capable of expressing the cloned sequence. Preferably, 
eukaryotic cells are cells of higher eukaryotes. Suitable eukaryotic cells include, but 

10 are not limited to, non-human mammalian tissue culture cells and human tissue culture 
cells. Preferred host cells include, but are not limited to, insect cells, HeLa cells, 
Chinese hamster ovary cells (CHO cells), African green monkey kidney cells (COS 
cells), human 293 cells, and murine 3T3 fibroblasts. Propagation of such cells in cell 
culture has become a routine procedure {see, Tissue Culture, Academic Press, Kruse 

15 and Patterson, Eds. (1973), which is incorporated herein by reference in its entirety). 

In addition, a yeast cell may be employed as a host cell. Preferred yeast cells 
include, but are not limited to, the genera Saccharomyces, Pichia andKluveromyces. 
Preferred yeast hosts are S. cerevisiae and P. pastoris. Preferred yeast vectors can 
contain an origin of replication sequence from a 2T yeast plasmid, an autonomously 

20 replication sequence (ARS), a promoter region, sequences for polyadenylation, 

sequences for transcription termination and a selectable marker gene. Shuttle vectors 
for replication in both yeast and E. coli are also included herein. 

Alternatively, insect cells may be used as host cells. In a preferred 
embodiment, the polypeptides of the invention are expressed using a baculovirus 

25 expression system (see, Luckow et al, Bio/Technology, 1988, 6, 47; Baculovirus 
Expression Vectors: A Laboratory Manual, 0*Rielly et al (Eds.), W.H. 
Freeman and Company, New York, 1992; and U.S. Patent No. 4,879,236, each of 
which is incorporated herein by reference in its entirety). In addition, the MAXBAC™ 
complete baculovirus expression system (Invitrogen) can, for example, be used for 

30 production in insect cells. 

Suitable host cells are discussed further in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, CA 
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(1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 
polymerase. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
5 conventional transformation or transfection techniques. As used herein, the terms 

"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
10 transfecting host cells can be found in Sambrook, et al. (Molecular Cloning: A 

LABORATORY MANUAL. 2nd ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon 
the expression vector and transfection technique used, only a small fraction of cells 
1 5 may integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Various 
selectable markers include those that confer resistance to drugs, such as G418, 
hygromycin, dihydrofolate reductase (DHFR) and methotrexate. Nucleic acid 
20 encoding a selectable marker can be introduced into a host cell on the same vector as 
that encoding NgR or can be introduced on a separate vector. Cells stably transfected 
with the introduced nucleic acid can be identified by drug selection (e.g., cells that 
have incorporated the selectable marker gene will survive, while the other cells die). 
In a preferred embodiment, the polypeptides of the invention, including forms 
25 of NgR2 and NgR3, soluble forms of NgR, chimeric NgR polypeptides, NgMg 
fusions and fragments and variations of each of the above are expressed in Chinese 
Hamster Ovary (CHO) cells. 

In order to introduce the DNA fragment coding for the NgR protein or 
polypeptide into the CHO cell to express the recombinant NgR protein or polypeptide, 
30 it is necessary to construct the expression vector. 

The vectors for CHO expression include, but are not limited to, pAl-1 1, pXTl, 
pRc/CMV, pRc/RSV and pcDNAINeo. The promoter is not specifically limited 
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provided it effectively promotes expression in CHO cells. Examples of suitable 
promoters are: SRa, SV40, LTR, CMV, and HSV-TK. Of these, CMV and Sm 
promoters are preferred. 

In addition to the above-mentioned promoters, the expression vectors may 
5 contain enhancers, splicing signals, polyadenylation signals, selectable markers and an 
S V40 replication origin. Suitable selectable markers include, but are not limited to the 
dihydrofolate reductase (DHFR) gene which provides resistance to methotrexate 
(MTX), the ampicillin resistance gene, and the neomycin resistance gene. 

Examples of the expression vectors each containing the DNA coding for NgR, 

10 portions, fragments and soluble constructs thereof include the vector (such as one 
described above), into which the promoter is operably linked (preferably upstream) to 
the nucleotide sequence encoding the desired NgR construct; a polyadenylation signal 
downstream from the nucleotide sequence encoding the NgR construct; and, 
preferably, the vector includes an operable DHFR gene. Preferably, the ampicillin 

15 resistant gene is also operably contained in the vector. 

CHO cell lacking the DHFR gene (Urlaub, G. et al, (1980) Proc. Natl Acad 
Set USA 77, 4216-4220) and CHO-K1 (Proa Natl Acad. ScL USA 60, 1275 (1968)) 
are suitable for use. 

The NgR expression vectors prepared as above are introduced into CHO cells 

20 by any known method, including, but not limited to the calcium phosphate method 

(Graham and van der Eb (1973) Virol 52, 456-467) and electroporation (Nuemann et 
al, (1982) EMBOJ. 1, 841-845). 

Transformants carrying the expression vectors are selected based on the 
above-mentioned selectable markers. Repeated clonal selection of the transformants 

25 using the selectable markers allows selection of stable cell lines having high expression 
of the NgR constructs. Increased MTX concentrations in the selection medium allows 
gene amplification and greater expression of the desired protein. The CHO cell 
containing the recombinant NgR can be produced by cultivating the CHO cells 
containing the NR expression vectors constitutively expressing the NgR constructs. 

30 Media used in cultivating CHO cells includes DMEM medium supplemented 

with about 0.5 to 20% fetal calf serum, DMEM medium and RPMI1640 medium. The 
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pH of the medium is preferably about 6 to 8. Cultivation is preferably at about 30 to 
40 fl C for about 15 to 72 hours with aeration. 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (z'.e., express) NgR protein. Accordingly, the 
5 invention further provides methods for producing NgR protein using the host cells of 
the invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding NgR has been 
introduced) in a suitable medium such that NgR protein is produced. In another 
embodiment, the method further comprises isolating NgR from the medium or the host 
10 cell. 

In situations where the NgR polypeptide will be found primarily intracellularly, 
intracellular material (including inclusion bodies for Gram-negative bacteria) can be 
extracted from the host cell using any standard technique known to one of ordinary 
skill in the art. Such methods would encompass, by way of example and not by way of 

15 limitation, lysing the host cells to release the contents of the periplasm/cytoplasm by 
French press, homogenization and/or sonication followed by centrifugation. 

If the NgR polypeptide has formed inclusion bodies in the cytosol, such 
inclusion bodies may frequently bind to the inner and/or outer cellular membranes. 
Upon centrifugation, the inclusion bodies will be found primarily in the pellet material. 

20 The pellet material can then be treated at pH extremes or with one or more chaotropic 
agents such as a detergent, guanidine, guanidine derivatives, urea, or urea derivatives 
in the presence of a reducing agent such as dithiothreitol at alkaline pH or 
tris-carboxyethyl phosphine at acid pH to release, break apart and solubilize the 
inclusion bodies. Once solubilized, NgR polypeptide can be analyzed using gel 

25 electrophoresis, immunoprecipitation or the like. Various methods of isolating the 
NgR polypeptide would be apparent to one of ordinary skill in the art, for example, 
isolation may be accomplished using standard methods such as those set forth below 
and in Marston et al (1990) Meth Enzymol. 182, 264-275 (incorporated by reference 
herein in its entirety). 

30 If isolated NgR polypeptide is not biologically active following the isolation 

procedure employed, various methods for "refolding" or converting the polypeptide to 
its tertiary structure and generating disulfide linkages, can be used to restore biological 
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activity. Methods known to one of ordinary skill in the art include adjusting the pH of 
the solubilized polypeptide to a pH usually above 7 and in the presence of a particular 
concentration of a chaotrope. The selection of chaotrope is very similar to the choices 
used for inclusion body solubilization but usually at a lower concentration and is not 
5 necessarily the same chaotrope as used for the solubilization. It may be required to 
employ a reducing agent or the reducing agent plus its oxidized form in a specific ratio, 
to generate a particular redox potential allowing for disulfide shuffling to occur in the 
formation of the protein's cysteine bridge(s). Some of the commonly used redox 
couples include cysteine/cystamine, glutathione (GSH)/dithiobis GSH, cupric chloride, 
10 dithiothreitol (DTT)/dithiane DTT, 2-mercaptoethanol (bME)/dithio-b(ME). To 

increase the efficiency of the refolding, it may be necessary to employ a cosolvent, such 
as glycerol, polyethylene glycol of various molecular weights and arginine. 

Transgenic Animals 

15 The host cells of the invention can also be used to produce non-human 

transgenic animals. For example, in one embodiment, a host cell of the invention is a 
fertilized oocyte or an embryonic stem cell into which NgR-coding sequences have 
been introduced. Such host cells can then be used to create non-human transgenic 
animals in which exogenous NgR sequences have been introduced into their genome or 

20 homologous recombinant animals in which endogenous NgR sequences have been 
altered. Such animals are useful for studying the function and/or activity of NgR and 
for identifying and/or evaluating modulators of NgR activity. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes 

25 a transgene. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous DNA that is 
integrated into the genome of a cell from which a transgenic animal develops and that 
remains in the genome of the mature animal, thereby directing the expression of an 
encoded gene product in one or more cell types or tissues of the transgenic animal. As 

30 used herein, a "homologous recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous NgR gene has been altered 
by homologous recombination between the endogenous gene and an exogenous DNA 
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molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal, 
prior to development of the animal. 

A transgenic animal of the invention can be created by introducing NgR- 
encoding nucleic acid into the male pronuclei of a fertilized oocyte, e.g., by 
5 microinjection, retroviral infection, and allowing the oocyte to develop in a 

pseudopregnant female foster animal. The human NgR DNA sequence of SEQ ID 
NOs: 1 or 3 can be introduced as a transgene into the genome of a non-human animal. 
Alternatively, a nonhuman homolog of the human NgR gene, such as a mouse NgR 
gene, can be isolated based on hybridization to the human NgR cDNA (described 

10 further above) and used as a transgene. Intronic sequences and polyadenylation signals 
can also be included in the transgene to increase the efficiency of expression of the 
transgene. A tissue-specific regulatory sequence(s) can be operably linked to the NgR 
transgene to direct expression of NgR protein to particular cells. Methods for 
generating transgenic animals via embryo manipulation and microinjection, particularly 

1 5 animals such as mice, have become conventional in the art and are described, for 

example, in U.S. Pat. Nos. 4,736,866; 4,870,009; and 4,873,191; andHogan 1986, in 
Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY. Similar methods are used for production of other transgenic 
animals. A transgenic founder animal can be identified based upon the presence of the 

20 NgR transgene in its genome and/or expression of NgR mRNA in tissues or cells of the 
animals. A transgenic founder animal can then be used to breed additional animals 
carrying the transgene. Moreover, transgenic animals carrying a transgene encoding 
NgR can further be bred to other transgenic animals carrying other transgenes. 
' To create a homologous recombinant animal, a vector is prepared which 

25 contains at least a portion of a NgR gene into which a deletion, addition or substitution 
has been introduced to thereby alter, e.g., functionally disrupt, the NgR gene. The 
NgR gene can be a human gene (e.g., SEQ ID NOs:l or 13), but more preferably, is a 
non-human homolog of a human NgR gene. For example, a mouse homolog of human 
NgR gene of SEQ ID NOs: 1 or 13 can be used to construct a homologous 

30 recombination vector suitable for altering an endogenous NgR gene in the mouse 
genome. In one embodiment, the vector is designed such that, upon homologous 
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recombination, the endogenous NgR gene is functionally disrupted (i. e. , no longer 
encodes a functional protein; also referred to as a "knock out" vector). 

Alternatively, the vector can be designed such that, upon homologous 
recombination, the endogenous NgR gene is mutated or otherwise altered but still 
5 encodes functional protein (e.g., the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous NgR protein). In the homologous 
recombination vector, the altered portion of the NgR gene is flanked at its 5 1 and 3 1 
ends by additional nucleic acid of the NgR gene to allow for homologous 
recombination to occur between the exogenous NgR gene carried by the vector and an 

10 endogenous NgR gene in an embryonic stem cell. The additional flanking NgR nucleic 
acid is of sufficient length for successful homologous recombination with the 
endogenous gene. Typically, several kilobases of flanking DNA (both at the 5' and 3 1 
ends) are included in the vector. See e.g., Thomas et al (1987) Cell 51:503 for a 
description of homologous recombination vectors. The vector is introduced into an 

15 embryonic stem cell line {e.g., by electroporation) and cells in which the introduced 
NgR gene has homologously recombined with the endogenous NgR gene are selected 
(see e.g., Li et al. (1992) Cell 69:915). 

The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras. See e.g., Bradley 1987, In: 

20 Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, 

Robertson, ed. IRL, Oxford, pp. 1 13-152. A chimeric embryo can then be implanted 
into a suitable pseudopregnant female foster animal and the embryo brought to term. 
Progeny harboring the homologously recombined DNA in their germ cells can be used 
to breed animals in which all cells of the animal contain the homologously recombined 

25 DNA by germline transmission of the transgene. Methods for constructing 
homologous recombination vectors and homologous recombinant animals are 
described further in Bradley (1991) Curr. Opin. Biotechnol 2:823-829; PCT 
International Publication Nos.: WO 90/1 1354; WO 91/01140; WO 92/0968; and WO 
93/04169. 

30 In another embodiment, transgenic non-humans animals can be produced that 

contain selected systems that allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI. For 
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a description of the cre/loxP recombinase system, see, e.g., Lakso et al (1992) Proc. 
Natl Acad. Sci. USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al (1991) 
Science 251:1351-1355. If a cre/loxP recombinase system is used to regulate 
5 expression of the transgene, animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such animals can be provided 
through the construction of "double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a selected protein and the 
other containing a transgene encoding a recombinase. 

10 Clones of the non-human transgenic animals described herein can also be 

produced according to the methods described in Wilmut et al (1997) Nature 385:810- 
813. In brief, a cell, e.g., a somatic cell, from the transgenic animal can be isolated and 
induced to exit the growth cycle and enter G 0 phase. The quiescent cell can then be 
fused, e.g., through the use of electrical pulses, to an enucleated oocyte from an animal 

15 of the same species from which the quiescent cell is isolated. The reconstructed 
oocyte is then cultured such that it develops to morula or blastocyte and then 
transferred to pseudopregnant female foster animal. The offspring borne of this female 
foster animal will be a clone of the animal from which the cell, e.g., the somatic cell, is 
isolated. 

20 

Antisense 

Also provided by the invention are antisense polynucleotides that recognize and 
hybridize to NgR polynucleotides. Full-length and fragment antisense polynucleotides 
are provided. Fragment antisense molecules of the invention include (i) those that 

25 specifically recognize and hybridize to NgR RNA (as determined by sequence 
comparison of DNA encoding NgR to DNA encoding other known molecules). 
Identification of sequences unique to NgR encoding polynucleotides can be deduced 
through use of any publicly available sequence database, and/or through use of 
commercially available sequence comparison programs. After identification of the 

30 desired sequences, isolation through restriction digestion or amplification using any of 
the various polymerase chain reaction techniques well known in the art can be 
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performed. Antisense polynucleotides are particularly relevant to regulating 
expression of NgR by those cells expressing NgR mRNA. 

Antisense oligonucleotides, or fragments of a nucleotide sequence set forth in 
SEQ ID NO:l, 3, 13 or sequences complementary or homologous thereto, derived 
5 from the nucleotide sequences of the present invention encoding NgR are useful as 
diagnostic tools for probing gene expression in various tissues. For example, tissue 
can be probed in situ with oligonucleotide probes carrying detectable groups by 
conventional autoradiography techniques to investigate native expression of this 
enzyme or pathological conditions relating thereto. In specific aspects, antisense 

10 nucleic acid molecules are provided that comprise a sequence complementary to at 

least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire NgR coding strand, or 
to only a portion thereof. Nucleic kcid molecules encoding fragments, homologs, 
derivatives and analogs of a NgR protein of SEQ ID NO:2, 4 or 14 or antisense 
nucleic acids complementary to a NgR nucleic acid sequence of SEQ ID NOs: 1, 3 or 

15 1 3 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding 
region" of the coding strand of a nucleotide sequence encoding NgR. The term 
"coding region" refers to the region of the nucleotide sequence comprising codons 
which are translated into amino acid residues (e.g., the protein coding region of human 

20 NgR corresponds to the coding region SEQ ID NO: 1, 3 or 13). In another 

embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" 
of the coding strand of a nucleotide sequence encoding NgR. The term "noncoding 
region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5* and 3' untranslated regions). 

25 Antisense oligonucleotides are preferably directed to regulatory regions of a 

nucleotide sequence of SEQ ID NO: 1, 3, 13 or mRNA corresponding thereto, 
including, but not limited to, the initiation codon, TATA box, enhancer sequences, and 
the like. Given the coding strand sequences encoding NgR disclosed herein (e.g., SEQ 
ID NO:l, 3 or 13), antisense nucleic acids of the invention can be designed according 

30 to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic 
acid molecule can be complementary to the entire coding region of NgR mRNA, but 
more preferably is an oligonucleotide that is antisense to only a portion of the coding 
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or noncoding region of NgR mRNA For example, the antisense oligonucleotide can 
be complementary to the region surrounding the translation start site of NgR mRNA. 
An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 
45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
5 constructed using chemical synthesis or enzymatic ligation reactions using procedures 
known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 
variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 

10 antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
substituted nucleotides can be used. 

Examples of modified nucleotides that can be used to generate the antisense 
nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 
hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 

15 carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 

20 mannosylqueosine, 5-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 
N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3-(3 -amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6- 

25 diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically 
using an expression vector into which a nucleic acid has been subcloned in an antisense 
orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

30 The antisense nucleic acid molecules of the invention (preferably 

oligonucleotides of 10 to 20 nucleotides in length) are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA 
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and/or genomic DNA encoding a NgR protein to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. Suppression of NgR 
expression at either the transcriptional or translation^ level is useful to generate 
cellular or animal models for diseases/conditions characterized by aberrant NgR 
5 expression. The hybridization can be by conventional nucleotide complementarity to 
form a stable duplex, or, for example, in the case of an antisense nucleic acid molecule 
that binds to DNA duplexes, through specific interactions in the major groove of the 
double helix. 

Phosphorothioate and methylphosphonate antisense oligonucleotides are 

10 specifically contemplated for therapeutic use by the invention. The antisense 
oligonucleotides may be further modified by adding poly-L~lysine, transferrin 
polylysine or cholesterol moieties at their 5' end. 

An example of a route of administration of antisense nucleic acid molecules of 
the invention includes direct injection at a tissue site. Alternatively, antisense nucleic 

1 5 acid molecules can be modified to target selected cells and then administered 

systemically. For example, for systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or antigens expressed on a 
selected cell surface, e.g., by linking the antisense nucleic acid molecules to peptides or 
antibodies that bind to cell surface receptors or antigens. The antisense nucleic acid 

20 molecules can also be delivered to cells using the vectors described herein. To achieve 
sufficient intracellular concentrations of antisense molecules, vector constructs in 
which the antisense nucleic acid molecule is placed under the control of a strong pol II 
or pol HI promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention 

25 is an cc-anomeric nucleic acid molecule. An cc-anomeric nucleic acid molecule forms 

i 

specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual p-units, the strands run parallel to each other (Gaultier et al 9 (1987) Nucleic 
Acids Res. 15, 6625-6641). The antisense nucleic acid molecule can also comprise a 
2'-o-methylribonucleotide (Inoue etal, (1987) Nucleic Acids Res. 15, 6131-6148) or a 
30 chimeric RNA-DNA analogue (Inoue et al, (1987) FEBS Lett. 215, 327-330). 

The NgR sequences taught in the present invention facilitate the design of 
novel transcription factors for modulating NgR expression in native cells and animals, 
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and cells transformed or transfected with NgR polynucleotides. For example, the 
Cys2-His2 zinc finger proteins, which bind DNA via their zinc finger domains, have 
been shown to be amenable to structural changes that lead to the recognition of 
different target sequences. These artificial zinc finger proteins recognize specific target 

5 sites with high affinity and low dissociation constants, and are able to act as gene 
switches to modulate gene expression. Knowledge of the particular NgR target 
sequence of the present invention facilitates the engineering of zinc finger proteins 
specific for the target sequence using known methods such as a combination of 
structure-based modeling and screening of phage display libraries (Segal et aL, (1999) 

10 Proc. Natl. Acad Sci. USA 96, 2758-2763; Liu et aL, (1997) Proc. Natl Acad. Sci. 

USA 94, 5525-5530; Greisman a/. (1997) Science 275, 657-661; Choo etaL, (1997) 
J. Mol Biol 273, 525-532). Each zinc finger domain usually recognizes three or more 
base pairs. Since a recognition sequence of 18 base pairs is generally sufficient in 
length to render it unique in any known genome, a zinc finger protein consisting of 6 

1 5 tandem repeats of zinc fingers would be expected to ensure specificity for a particular 
sequence (Segal et aL, (1999), above). The artificial zinc finger repeats, designed 
based on the promoter of NgR sequences, are fused to activation or repression 
domains to promote or suppress NgR expression (Liu et aL, (1997), above). The 
promoter of NgR may be obtained by standard methods known to one of ordinary skill 

20 in the art with the disclosure contained herein and knowledge of the NgR sequence. 
Alternatively, the zinc finger domains can be fused to the TATA box-binding factor 
(TBP) with varying lengths of linker region between the zinc finger peptide and the 
TBP to create either transcriptional activators or repressors (Kim et aL, (1997) Proc. 
Natl. Acad Sci. USA 94, 3616-3620. Such proteins and polynucleotides that encode 

25 them, have utility for modulating NgR expression in vivo in both native cells, animals 
and humans; and/or cells transfected with NgR-encoding sequences. The novel 
transcription factor can be delivered to the target cells by transfecting constructs that 
express the transcription factor (gene therapy), or by introducing the protein. 
Engineered zinc finger proteins can also be designed to bind RNA sequences for use in 

30 therapeutics as alternatives to antisense or catalytic RNA methods (McColl et aL, 
(1997) Proc. Natl Acad. Sci. USA 96, 9521-9526); Wu et aL, (1995) Proc. Natl. 
Acad Sci. USA 92, 344-348). The present invention contemplates methods of 
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designing such transcription factors based on the gene sequence of the invention, as 
well as customized zinc finger proteins, that are useful to modulate NgR expression in 
cells (native or transformed) whose genetic complement includes these sequences. 

5 Ribozymes and PNA moieties 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes, 

10 described in Haselhoff and Gerlach (1988) Nature 334, 585-591) can be used to 
catalytically cleave NgR mRNA transcripts to thereby inhibit translation of NgR 
mRNA. A ribozyme having specificity for a NgR-encoding nucleic acid can be 
designed based upon the nucleotide sequence of aNgRDNA disclosed herein (i.e., 
SEQ ID NOs:l, 3 or 13). For example, a derivative of a Tetrahymena L-19 IVS RNA 

1 5 can be constructed in which the nucleotide sequence of the active site is 

complementary to the nucleotide sequence to be cleaved in a NgR-encoding mRNA. 
See, e.g., Cech et al. U.S. Patent No. 4,987,071; and Cech et al U.S. Patent No. 
5, 1 16,742, Alternatively, NgR mRNA can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules. See, e.g., Bartel et al, 

20 (1993) Science 261, 1411-1418. 

Alternatively, NgR gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of the NgR (e.g. , the NgR 
promoter and/or enhancers) to form triple helical structures that prevent transcription 
of the NgR gene in target cells. See generally, Helene (1991) Anticancer Drug Des. 6: 

25 569-584; Helene. et al., (1992) Ann. N. Y. Acad Sci. 660:27-36; and Maher (1992) 
BioEssayslA, 807-815. 

In various embodiments, the nucleic acids of NgR can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, 
hybridization, or solubility of the molecule. For example, the deoxyribose phosphate 

30 backbone of the nucleic acids can be modified to generate peptide nucleic acids (see 
Hyrup et al, (1996) Bioorg. Med Chem. Lett. 4, 5-23). As used herein, the terms 
"peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in 
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which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone 
and only the four natural nucleobases are retained. The neutral backbone of PNAs has 
been shown to allow for specific hybridization to DNA and RNA under conditions of 
low ionic strength. The synthesis of PNA oligomers can be performed using standard 
5 solid phase peptide synthesis protocols as described in Hyrup et al, (1996) above; 
Perry-OKeefe et al, (1996) Proc. Natl Acad Sci. USA 93,14670-14675. 

PNAs of NgR can be used in therapeutic and diagnostic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing transcription or translation arrest or 

10 inhibiting replication. PNAs of NgR can also be used, e.g., in the analysis of single 
base pair mutations in a gene by, e.g., PNA directed PCR clamping; as artificial 
restriction en2ymes when used in combination with other en2ymes, e.g., SI nucleases 
(Hyrup (1996), above); or as probes or primers for DNA sequence and hybridization 
(Hyrup eta!., (1996), above; Perry-O'Keefe (1996), above). 

15 In another embodiment, PNAs of NgR can be modified, e.g., to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by 
the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques 
of drug delivery known in the art. For example, PNA-DNA chimeras of NgR can be 
generated that may combine the advantageous properties of PNA and DNA. Such 

20 chimeras allow DNA recognition enzymes, e.g. , RNase H and DNA polymerases, to 
interact with the DNA portion while the PNA portion would provide high binding 
affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate 
lengths selected in terms of base stacking, number of bonds between the nucleobases, 
and orientation (Hyrup (1996), above). The synthesis of PNA-DNA chimeras can be 

25 performed as described in Hyrup (1996), above and Finn et al. (1996) Nucleic Acids 
Res. 24, 3357-3363. For example, a DNA chain can be synthesized on a solid support 
using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, 
e.g., 5-(4-methoxytrityl) amino-5'-deoxy-thymidine phosphoramidite, can be used 
between the PNA and the 5' end of DNA (Mag et al. (1989) Nucleic Acids Res. 17, 

30 973-988). PNA monomers are then coupled in a stepwise manner to produce a 

chimeric molecule with a 5' PNA segment and a 3 1 DNA segment (Finn et al. (1996), 
above). Alternatively, chimeric molecules can be synthesized with a 5' DNA segment 
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and a 3' PNA segment. See, Petersen et al (1975) Bioorg. Med. Chem. Lett. 5:1119- 
1124. 

In other embodiments, the oligonucleotide may include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
5 transport across the cell membrane (see Letsinger et al, (1989) Proc. Natl Acad Sci. 
USA 86, 6553-6556; Lemaitre etal, (1987) Proc. Natl Acad Sci. USA 84, 648-652; 
PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization triggered cleavage agents (see, e.g., Krol et al, (1988) Biotechniques 6, 

10 958-976) or intercalating agents (see, e.g., Zon (1988) Pharm. Res. 5, 539-549). To 
this end, the oligonucleotide may be conjugated to another molecule, e.g., a peptide, a 
hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
cleavage agent, etc. 

Automated sequencing methods can be used to obtain or verify the nucleotide 

15 sequence of NgR. The NgR nucleotide sequences of the present invention are believed 
to be 100% accurate. However, as is known in the art, nucleotide sequence obtained 
by automated methods may contain some errors. Nucleotide sequences determined by 
automation are typically at least about 90%, more typically at least about 95% to at 
least about 99.9% identical to the actual nucleotide sequence of a given nucleic acid 

20 molecule. The actual sequence may be more precisely determined using manual 

sequencing methods, which are well known in the art. An error in a sequence which 
results in an insertion or deletion of one or more nucleotides may result in a frame shift 
in translation such that the predicted amino acid sequence will differ from that which 
would be predicted from the actual nucleotide sequence of the nucleic acid molecule, 

25 starting at the point of the mutation. 



Polypeptides 

The invention also provides purified and isolated mammalian NgR polypeptides 
encoded by a polynucleotide of the invention. Presently preferred is a human NgR 
30 polypeptide comprising the amino acid sequence set forth in SEQ ID NO:2 or SEQ ID 
NO: 14. Another preferred embodiment is a mouse NgR polypeptide comprising the 
amino acid sequence of NgR3, as set forth in SEQ ID NO:4. 
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One aspect of the invention pertains to isolated NgR proteins, and biologically 
active portions thereof, or derivatives, fragments, analogs or homologs thereof. Also 
provided are polypeptide fragments suitable for use as immunogens to raise anti-NgR 
antibodies. Preferably, fragments of NgR proteins comprise at least one biological 
5 activity of NgR. In one embodiment, native NgR proteins can be isolated from cells or 
tissue sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, NgR proteins are produced by recombinant DNA 
techniques. Alternative to recombinant expression, a NgR protein or polypeptide can 
be synthesized chemically using standard peptide synthesis techniques. 

10 The invention also embraces polypeptides that have at least 99%, at least 95%, 

at least 90%, at least 85%, at least 80%, at least 75%, at least 70%, at least 65%, at 
least 60%, at least 55%, at least 50% or at least 45% identity and/or homology to the 
preferred polypeptide of the invention. In addition, the invention embraces 
polypeptides having the consensus sequence shown in SEQ ID NO: 6, shown in Table 

15 5) excluding the previously characterized NgR ("NgRl "), and polypeptides comprising 
at least about 90% of the consensus sequence. 

The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over that region of comparison, determining the number of 
positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I, in the case 

20 of nucleic acids) occurs in both sequences to yield the number of matched positions, 
dividing the number of matched positions by the total number of positions in the region 
of comparison (i.e., the window size), and multiplying the result by 100 to yield the 
percentage of sequence identity. The term "substantial identity" as used herein denotes 
a characteristic of a polynucleotide sequence, wherein the polynucleotide comprises a 

25 sequence that has at least 80 percent sequence identity, preferably at least 85 percent 
identity and often 90 to 95 percent sequence identity, more usually at least 99 percent 
sequence identity as compared to a reference sequence over a comparison region. 

In one aspect, percent homology is calculated as the percentage of amino acid 
residues in the smaller of two sequences which align with identical amino acid residue 

30 in the sequence being compared, when four gaps in a length of 100 amino acids may be 
introduced to maximize alignment (Dayhoff, in ATLAS OF PROTEIN SEQUENCE AND 
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Structure, Vol. 5, p. 124, National Biochemical Research Foundation, Washington, 
D.C. (1972), incorporated herein by reference). 

A determination of homology or identity is typically made by a computer 
homology program known in the art. An exemplary program is the Gap program 
5 (Wisconsin Sequence Analysis Package, Version 8 for UNIX, Genetics Computer 
Group, University Research Park, Madison, WI) using the default settings, which uses 
the algorithm of Smith and Waterman {Adv. Appl Math., 1981, 2, 482-489, which in 
incorporated herein by reference in its entirety). Employing the GAP software 
provided in the GCG program package, (see Needleman and Wunsch (1970) J. Mol 

10 Biol 48, 443-453) the following settings for nucleic acid sequence comparison may be 
used: GAP creation penalty of 5.0 and GAP extension penalty of 0.3, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 
identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99%, with 
the CDS (encoding) part of the DNA sequence shown in SEQ ID NOs:l, 3 or 13. 

1 5 BestFit was originally written for Version 1 .0 by Paul Haeberli from a careful reading 
of the papers by Needleman and Wunsch (1970), above, and Smith and Waterman 
(1981), above. The following Bestfit settings for nucleic acid sequence comparison 
may be used: GAP creation penalty of 8.0 and GAP extension penalty of 2, the coding 
region of the analogous nucleic acid sequences referred to above exhibits a degree of 

20 identity preferably of at least 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99%, with the 
CDS (encoding) part of the amino acid sequence shown in SEQ ID NOs:2, 4 or 14. 

Alternatively, homology may be determined by hybridization analysis wherein a 
nucleic acid sequence is hybridized to the complement of a sequence encoding the 
aforementioned proteins under stringent, moderately stringent, or low stringent 

25 conditions. See e.g. Ausubel, et al , (Eds.) CURRENT PRpTOCOLS IN MOLECULAR 
Biology, John Wiley & Sons, New York, NY, 1993, and below. 

Polypeptides of the invention may be isolated from natural cell sources or may 
be chemically synthesized, but are preferably produced by recombinant procedures 
involving host cells of the invention. 

30 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the NgR protein is derived, or substantially free from 
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chemical precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of NgR protein in which 
the protein is separated from cellular components of the cells from which it is isolated 
or recombinantly produced. In one embodiment, the language "substantially free of 

5 cellular material" includes preparations of NgR protein having less than about 30% (by 
dry weight) of non-NgR protein (also referred to herein as a "contaminating protein"), 
more preferably less than about 20% of non-NgR protein, still more preferably less 
than about 10% of non-NgR protein, and most preferably less than about 5% non-NgR 
protein. When the NgR protein or biologically active portion thereof is recombinantly 

10 produced, it is also preferably substantially free of culture medium, r.e., culture 

medium represents less than about 20%, more preferably less than about 10%, and 
most preferably less than about 5% of the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" 
includes preparations of NgR protein in which the protein is separated from chemical 

1 5 precursors or other chemicals that are involved in the synthesis of the protein. In one 
embodiment, the language "substantially free of chemical precursors or other 
chemicals" includes preparations of NgR protein having less than about 30% (by dry 
weight) of chemical precursors or non-NgR chemicals, more preferably less than about 
20% chemical precursors or non-NgR chemicals, still more preferably less than about 

20 10% chemical precursors or non-NgR chemicals, and most preferably less than about 
5% chemical precursors or non-NgR chemicals. 

Biologically active portions of a NgR protein include peptides comprising 
amino acid sequences sufficiently homologous to or derived from the amino acid 
sequence of the NgR protein, e.g., the amino acid sequence shown in SEQ ID NO:2, 4 

25 or 14 that include fewer amino adds than the full length NgR proteins, and exhibit at 
least one activity of a NgR protein. Typically, biologically active portions comprise a 
domain or motif with at least one activity of the NgR protein. A biologically active 
portion of a NgR protein can be a polypeptide which is, for example, 10, 25, 50, 100 
or more amino acids in length. 

30 A biologically active portion of a NgR protein of the present invention may 

contain at least one of the features that is conserved between the NgR proteins (e.g., a 
conserved cysteine as the N-terminus of the mature protein, four conserved cysteines 
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in the N-terminus before a leucine-rich region, four conserved cysteines C-terminal 
with respect to a leucine repeat region, eight leucine-rich repeats, and a hydrophobic 
C-terminus). An alternative biologically active portion of a NgR protein may contain 
at least two of the above-identified domains. Another biologically active portion of a 
5 NgR protein may contain at least three of the above-identified domains. Yet another 
biologically active portion of a NgR protein of the present invention may contain at 
least four of the above-identified domains. 

Moreover, other biologically active portions, in which other regions of the 
protein are deleted, can be prepared by recombinant techniques and evaluated for one 

10 or more of the functional activities of a native NgR protein. 

In an embodiment, the NgR protein has an amino acid sequence shown in SEQ 
ID NO:2, 4 or 14. In other embodiments, the NgR protein is substantially homologous 
to SEQ ID NO:2, 4 or 14 and retains the functional activity of the protein of SEQ ID 
NO:2, 4 or 14, yet differs in amino acid sequence due to natural allelic variation or 

15 mutagenesis, as described in detail below. 

Accordingly, in another embodiment, the NgR protein is a protein that 
comprises an amino acid sequence at least about 45% homologous to the amino acid 
sequence of SEQ ID NO:2 or SEQ ID NO:4 or SEQ ID NO: 14 and retains the 
functional activity of the NgR proteins of SEQ ID NO:2, 4 or 14. 

20 Use of mammalian host cells is expected to provide for such post-translational 

modifications (e.g., glycosylation, truncation, lipidation and phosphorylation) as may 
be needed to confer optimal biological activity on recombinant expression products of 
. the invention. Glycosylated and non-glycosylated forms of NgR polypeptides are 
embraced by the invention. 

25 The invention also embraces variant (or analog) NgR polypeptides. In one 

example, insertion variants are provided wherein one or more amino acid residues 
supplement a NgR amino acid sequence. Insertions may be located at either or both 
termini of the protein, or may be positioned within internal regions of the NgR amino 
acid sequence. Insertional variants with additional residues at either or both termini 

30 can include, for example, fusion proteins and proteins including amino acid tags or 
labels. 
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Insertion variants include NgR polypeptides wherein one or more amino acid 
residues are added to a NgR acid sequence or to a biologically active fragment thereof. 

Variant products of the invention also include mature NgR products, i.e., NgR 
products wherein leader or signal sequences are removed, with additional amino 
5 terminal residues. The additional amino terminal residues may be derived from another 
protein, or may include one or more residues that are not identifiable as being derived 
from specific proteins. NgR products with an additional methionine residue at 
position -1 (Met -1 -NgR) are contemplated, as are variants with additional methionine 
and lysine residues at positions -2 and -1 (Met^-Lys^-NgR). Variants of NgR with 
10 additional Met, Met-Lys, Lys residues (or one or more basic residues in general) are 
particularly useful for enhanced recombinant protein production in bacterial host cells. 

Polypeptide Variants 

The invention also embraces NgR variants having additional amino acid 

15 residues which result from use of specific expression systems. 

As used herein, a NgR "chimeric protein" or "fusion protein" comprises a NgR 
polypeptide operatively linked to a non-NgR polypeptide. A ,f NgR polypeptide" refers 
to a polypeptide having an amino acid sequence corresponding to NgR, whereas a 
"non-NgR polypeptide" refers to a polypeptide having an amino acid sequence 

20 corresponding to a protein that is not homologous to the NgR protein, e.g., a protein 
that is different from the NgR protein and that is derived from the same or a different 
organism. Within a NgR fusion protein the NgR polypeptide can correspond to all or a 
portion of a NgR protein. In one embodiment, a NgR fusion protein comprises at least 
one biologically active portion of a NgR protein. In another embodiment, a NgR 

25 fusion protein comprises at least two biologically active portions of a NgR protein. In 
yet another embodiment, a NgR fusion protein comprises at least three biologically 
active portions of a NgR protein. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the NgR polypeptide and the non-NgR polypeptide 
are fused in-frame to each other. The non-NgR polypeptide can be fused to the N- 

30 terminus or C-terminus of the NgR polypeptide. 

For example, in one embodiment a NgR fusion protein comprises a NgR 
domain operably linked to the extracellular domain of a second protein. Such fusion 
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proteins can be further utilized in screening assays for compounds which modulate 
NgR activity (such assays are described in detail below). 

For example, use of commercially available vectors that express a desired 
polypeptide as part of a glutathione-S-transferase (GST) fusion product provides the 
5 desired polypeptide having an additional glycine residue at position -1 after cleavage of 
the GST component from the desired polypeptide. 

In another embodiment, the fusion protein is a NgR protein containing a 
heterologous signal sequence at its N-terminus. For example, the native NgR signal 
sequence (i.e., amino acids 1-30 of SEQ ID NO:2 and amino acids 1-40 of SEQ ID 

10 NO:4) can be removed and replaced with a signal sequence from another protein. In 
certain host cells (e.g., mammalian host cells), expression and/or secretion NgR can be 
increased through use of a heterologous signal sequence. 

In yet another embodiment, the fusion protein is a NgR-immunoglobulin fusion 
protein in which the NgR sequences comprising one or more domains are fused to 

1 5 sequences derived from a member of the immunoglobulin protein family. The NgR- 
immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between NgR ligand and a NgR protein on the surface of a cell, to thereby suppress 
NgR-mediated signal transduction in vivo. NgR-immunoglobulin fusion proteins can be 

20 used to affect the bioavailability of a NgR cognate ligand. Inhibition of the NgR 
ligand/NgR interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the NgR-immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-NgR antibodies in a subject, to 

25 purify NgR ligands, and in screening assays to identify molecules that inhibit the 
interaction of NgR with NgR ligand. 

A NgR chimeric or fusion protein of the invention can be produced by standard 
recombinant DNA techniques. For example, DNA fragments coding for the different 
polypeptide sequences are ligated together in-frame in accordance with conventional 

30 techniques, e.g. , by employing blunt-ended or stagger-ended termini for ligation, 

restriction en2yme digestion to provide for appropriate termini, filling-in of cohesive 
ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining and 
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enzymatic ligation. In another embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA synthesizers. Alternatively, PCR 
amplification of gene fragments can be carried out using anchor primers that give rise 
to complementary overhangs between two consecutive gene fragments that can 
5 subsequently be annealed and reamplified to generate a chimeric gene sequence (see, 
for example, Ausubel et al (Eds.) Current Protocols in Molecular Biology, 
John Wiley & Sons, 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). A NgR- 
encoding nucleic acid can be cloned into such an expression vector such that the fusion 
10 moiety is linked in-frame to the NgR protein. 

Variants resulting from expression in other vector systems are also 
contemplated. 

Insertional variants also include fusion proteins wherein the amino terminus 
and/or the carboxy terminus of NgR is/are fused to another polypeptide. 

15 In another aspect, the invention provides deletion variants wherein one or more 

amino acid residues in a NgR polypeptide are removed. Deletions can be effected at 
one or both termini of the NgR polypeptide, or with removal of one or more 
non-terminal amino acid residues of NgR. Deletion variants, therefore, include all 
fragments of a NgR polypeptide. 

20 The invention also embraces polypeptide fragments of the sequence set forth in 

SEQ ID NO:2, 4 or 14 wherein the fragments maintain biological (e.g., ligand binding 
and/or intracellular signaling) immunological properties of a NgR polypeptide. 
Fragments comprising at least 4, 5, 10, 15, 20, 25, 30, 35, or 40 consecutive amino 
acids of SEQ ID NO:2, 4 or 14 are contemplated by the invention. Preferred 

25 polypeptide fragments display antigenic properties unique to, or specific for, human 
NgR and its allelic and species homologs. Fragments of the invention having the 
desired biological and immunological properties can be prepared by any of the methods 
well known and routinely practiced in the art. 

In still another aspect, the invention provides substitution variants of NgR 

30 polypeptides. Substitution variants include those polypeptides wherein one or more 
amino acid residues of a NgR polypeptide are removed and replaced with alternative 
residues. In one aspect, the substitutions are conservative in nature; however, the 
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invention embraces substitutions that are also non-conservative. Conservative 
substitutions for this purpose may be defined as set out in Tables 2, 3, or 4 below. 



Table 1. 





Column I 


Column II 


(based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 








G, R, M 




X, 


A,D,C 




x 3 


V,T 




X* 


N,P,S 






E,A,S 




Xfi 


nothing, K 


nothing 


x 7 


V,M,P 




x, 


T, V 


V 


x* 


Q,P 


Q 




Q, A 


Q 


x„ 


Q.H.N 




X 12 


G,N 


N 


x 13 


L,F 


F 


X 14 


Q,A,S 




X 15 


A.S 




x 16 


v,i 




X 17 


V,T,E,L 




X 18 


S,G 




X 19 


L,I 




X20 


A,E,V,P 




X* 


A,S,D 




X22 


S,T 






Q,E 
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i^oiumn x 


i^oiumn u 


(based on a NTLRRCT 




fR2+R3 onlvt 


domain) 






24 


ivl 




X™ 


Q,H 


Q 




N.G 


N 


27 


RX 




X28 


T,G,R,S 




X29 


F,L,T,H 




X30 


L,V 


L 




Q.RJP 




X32 


Q,P,A 


P 


33 


G,A 


G 




H,T,S 




x,« 

35 


S,G,R 




x« 

36 


RS A 




x, 7 


C, nothing 


nothing 


X38 


R, nothing 


nothing 


X39 


AN 




X40 


ML 




X,, 


V.L.T 




X42 


*> * 


T 


x« 

^M3 


LI 




X^ 

' x 44 


YFH 




x« 

45 


N,V 

> * 


N 


x 46 


LL 




X47 


T,S,A 




X48 


F,Y,T,R 




X49 


A,H,Y,D 
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column 1 


column u 




fRl "R2 R3i 

^XVAj IVirj XX»/y 


fR2+R3 onlvi 


domain) 






X50 


P,A 


p 


51 


N.S.G.A 






T A 


T 


x« 


ER,T 






GH 




^55 


FL 






VO,H 










Xjg 


E,Q 


E 


X59 


GS 


G 


Xc„ 


R,A j 


R 






0 


X52 


R,H 


H 


x« 


T,S 








L 


x*< 


A ED 




x* 


EDA 






OH 


Q 


Xg 8 


VEG 




X59 


K,R 




X70 






x 71 

71 






X 72 


Y,H 




X73 


Y,D 


Y 


X74 


K,R 




x 73 


G,Q 
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Column 1 


column ii 


fhacpH nn a NTTUUPT 

^UaSvU UU a ll J. J_jXVXVV^ X 


nil m "R3i ' 


rR2+1H nnlvi 

1 JEV&r 1 M\aJ Ullljr ^ 


domain^ 

wyui Will m 






Y 
^76 


s 0 


s 


Xr, 


A,SE 




X73 


PG 


p 


X79 


A,GP 






G,N 






I,V,L 




^82 


G,R 




^83 


KV A 

xx, v j-rx 




X™ 


S A 


s 


Xg5 


DE 




x»< 


H,S,A 




yv 87 


IX 

X,XV 




Xpp 
^88 


ELO 






YHLA 

x 




^*90 


OP 


0 


X51 


D,N 




'*92 


I L T 

X, X^j X 




Xo, 
**93 


V AR 




Xo. 


V AG 

V j^x,VJ 






S T 


S i 


^6 


K,R 






LI 


L 


^98 


W,R,S 






S,L 




X 100 


L,V 


L 


X101 


G,T,P 





WO 02/29059 



PCT/US01/31488 



-53- 





column i 


column ii 


Chased on a NTLRRCT 


fRl R2. R31 


fR2+R3 nnM 

ll**< 1 1W If 111 T f 


domain) 






X 

102 


Q,P,E 




x 

"103 






x 

■'104 


LT,V,A 

5 J 5 




x 

"105 


V,G,H 




x 

"106 


N,S 




x 

"107 


E,G,Q 




x 

■*M08 






x 

"109 


L,V 




x 

"110 


0,A 






WG,H 




x 

"112 


BLR.P 




x 

"113 


K,A,H 




x 

"114 


H,R 




x 

'Ml 5 


D,G 




^116 


HJELS.G 




x 

"117 


T,M 




x 

"■118 


T,I 




x 

"119 


F,Y 

> 




Y 

'M20 


N,A 




^121 


S,N 




x 

"122 


T A S 




x 

"123 


E,S,A 




X124 


Q,P 




^125 


G,T 




^126 


D,E 


D 


^127 


CA 
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f nlnmn T 


fftliiTtin XT 


(based on a NTLRRCT 


(Rl. R2, R3) 


(R2+R3 only) 


domain) 






X128 


P,D 




Xj29 


V,G,P,R 




X130 


A,S 






E,Q 


Q 


Xi-ao 


F,Y 


F 


Xj33 


GAD 




Xi-a4 


A.P 




X135 


D.A.V 




X136 


G,D 




Xj37 


A,E 




X138 


S,P 




X]39 


E,A 




XlAA 
A«H/ 


L,F 




XlAl 


RQ 




X149 
A«W 


PsK 


R 


Xi 40 


RK 


R 


^^144 


F,A 




X145 


G,V 






A.D.E 




^^147 


T,P 






A,V,S 




Xj49 


T.S.L 




X150 


E,G,P,Q 




X15I 


L,E,R 




X152 


R.L 


R 


X 15 3 


G, D 
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V # 


Cnliimn T 


fnlunin TT 


\ (based on a NTLRRCT 


(Rl, R2, R3) 


(R2+R3 only) 


domain) 






^154 


Q,H,A 




^155 


Q,R 




^156 


K,R 




^157 


L,A,R 




^158 


R.A 


R 


X 159 


V,A,E 




^160 


EAN 




^161 


F,L 


F 


^162 


RQ 




^163 


NAG 





Variant polypeptides include those wherein conservative substitutions have 
been introduced by modification of polynucleotides encoding polypeptides of the 
invention. Amino acids can be classified according to physical properties and 
15 contribution to secondary and tertiary protein structure. A conservative substitution is 
recognized in the art as a substitution of one amino acid for another amino acid that 
has similar properties. Exemplary conservative substitutions are set out in Table 2 
(from WO 97/09433, page 10, published March 13, 1997 (PCT/GB96/02197, filed 
9/6/96), immediately below. 



20 



Table 2 
Conservative Substitutions I 
SIDE CHAIN 

CHARACTERISTIC AMINO ACID 

Aliphatic 

Non-polar GAP 

ILV 
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Polar - uncharged C S T M 

NQ 

Polar - charged D E 

KR 

Aromatic HFWY 

Other NQDE 
5 Alternatively, conservative amino acids can be grouped as described in 

Lehninger, [BIOCHEMISTRY, Second Edition; Worth Publishers, Inc. NY, NY 

(1975), pp.71-77] as set out in Table 3, immediately below. 

Table 3 

Conservative Substitutions n 

10 

SIDE CHAIN 



CHARACTERISTIC 


AMINO ACID 


Non-polar (hydrophobic) 




A Aliphatic: 


ALIVP 


B. Aromatic: 


FW 


C. Sulmr-containing: 


M 


D. Boderline: 


G 


Uncharged-polar 




A Hydroxyl: 


STY 


B. Amides: 


NQ 


C. Sylfhydryl: 


C 


D. Boderline: 


G 


Positively Charged (Basic): 


KRH 


Negatively Charged (Acidic): 


DE 



As still another alternative, exemplary conservative substitutions are set out in 
25 Table 4, below. 



Table 4 

Conservative Substitutions IDE 

Original Exemplary Substitution 
Residue 

Ala (A) Val,Leu,Ile 

30 Arg (R) Lys, Gin, Asn 

Asn (N) Gin, His, Lys, Arg 

Asp (D) Glu 

Cys (C) Ser 

Gin (Q) Asn 

35 Glu(E) Asp 
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His (H) 


Asn, Gin, Lys, Arg 


lie (I) 


Leu, Val, Met, Ala, Phe, 


Leu(L) 


He, Val, Met, Ala, Phe 


Lys(K) 


Arg, Gin, Asn 


Met(M) 


Leu, Phe, He 


Phe(F) 


Leu, Val, He, Ala 


Pro(P) 


Gly 


Ser(S) 


Thr 


Thr(T) 


Ser 


Trp(W) 


Tyr 


Tyr(Y) 


Trp, Phe, Thr, Ser 


Val(V) 


He, Leu, Met, Phe, Ala 



In addition, amino acid residues that are conserved among family members of 
1 5 the NgR proteins of the present invention, as indicated by the alignment presented 
herein, are also predicted to be particularly unamenable to alteration. For example, 
NgR proteins of the present invention can contain at least one domain that is a typically 
conserved region in NgRs. Examples of these conserved domains include, e.g., 
leucine-rich repeat domain. Amino acid residues that are not conserved or are only 
20 semi-conserved among members of the NgR proteins may be readily amenable to 
alteration. 

Full-length NgRs have an LRR region characterized by the amino acid 
consensus sequence shown in SEQ ID NO: 19. At least some full-length NgRs also 
include a CT signaling (CTS) domain and a GPI domain. 
25 The NgR domain designations used herein are defined as follows: 



Domain 


hNgRl 
SEQ ID: 5 


mNgRl 
SEQ ID 
NO: 17 


hNgR2 
SEQ ID: 2 


hNgR3 
SEQ ID: 14 


mNgR3 
SEQ ID: 4 


Signal Seq. 


1-26 


1-26 


1-30 




1-40 


LRRNT 


27-56 


27-56 


31-59 




41-69 


LRR1 


57-81 


57-81 


60-82 


5-27 


70-92 ' 


LRR2 


82-105 


82-105 


83-106 


28-51 


93-106 


LRR3 


106-130 


106-130 


107-131 


52-76 


106-141 


LRR.4 


131-154 


131-154 


132-155 


77-100 


142-165 
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LRR5 


155-178 


155-178 


156-179 


101-124 


166-189 


LRR6 


179-202 


179-202 


180-203 


125-148 


190-213 


LRR7 


203-226 


203-226 


204-227 


149-172 


214-237 


T DD 0 


22 /-250 


OOT OCA 

227—250 


OOQ OCT 

22o— ZD I 




2 Jo— 2ol 




20U— JUy 


20U— 


201— J 1U 


ZUO— ZD J 


oti ion 
2 / 1— J2U 


CTS 
(CT 

Signaling) 


310-445 


310-445 


311-395 


256-396 


321-438 


GPI 


446-473 


456^73 


396-420 


370-392 


439^62 



10 

In some embodiments of the invention, the above domains are modified. 
Modification can be in a manner that preserves domain functionality. Modification can 
include addition, deletion or substitution of certain amino acids. Exemplary 
modifications include conservative amino acid substitutions. Preferably such 

15 substitutions number 20 or fewer per 100 residues. More preferably, such 

substitutions number 10 or fewer per 100 residues. Further exemplary modifications 
include addition of flanking sequences of up to five amino acids at the N terminus 
and/or C terminus of one or more of the domains. 

In some embodiments, the isolated nucleic acid molecule encodes a polypeptide 

20 at least about 70%, 80%, 90%, 95%, 98%, and most preferably at least about 99% 
homologous to SEQ ED NO:2, 4 or 14. 

Mutations can be introduced into SEQ ID NOS: 1, 3 or 13 by standard 
techniques, e.g., site-directed mutagenesis and PCR-mediated mutagenesis. 
Conservative amino acid substitutions can be made at one or more amino acid residues 

25 predicted to be non-essential. Alternatively, mutations can be introduced randomly 
along a NgR coding sequence. This can be accomplished, e.g., by saturation 
mutagenesis. The resulting mutants can be screened for NgR biological activity. 
Biological activities of NgR may include but are not limited to: (1) proteimprotein 
interactions, e.g., with other NgRs or other cell-surface proteins involved in Nogo- 

30 related signaling; (2) complex formation with a NgR ligand; (3) binding to an anti- 
NgR antibody. 
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It should be understood that the definition of polypeptides of the invention is 
intended to include polypeptides bearing modifications other than insertion, deletion, 
or substitution of amino acid residues. By way of example, the modifications may be 
covalent in nature, and include for example, chemical bonding with polymers, lipids, 
5 other organic and inorganic moieties. Such derivatives may be prepared to increase 
circulating half-life of a polypeptide, or may be designed to improve the targeting 
capacity of the polypeptide for desired cells, tissues or organs. Similarly, the invention 
further embraces NgR polypeptides that have been covalently modified to include one 
or more water-soluble polymer attachments such as polyethylene glycol, 

10 polyoxyethylene glycol or polypropylene glycol. Variants that display ligand binding 
properties of native NgR and are expressed at higher levels, as well as variants that 
provide for constitutively active receptors, are particularly useful in assays of the 
invention; the variants are also useful in providing cellular, tissue and animal models of 
diseases/conditions characterized by aberrant NgR activity. 

1 5 Chemically modified NgR polypeptide compositions in which the NgR 

polypeptide is linked to a polymer are included within the scope of the present 
invention. The polymer may be water soluble to prevent precipitation of the protein in 
an aqueous environment, such as a physiological environment. Suitable water-soluble 
polymers may be selected from the group consisting of, for example, polyethylene 

20 glycol (PEG), monomethoxypolyethylene glycol, dextran, cellulose, or other 
carbohydrate based polymers, poly-(N-vinyl pyrrolidone) polyethylene glycol, 
polypropylene glycol homopolymers, a polypropylene oxide/ethylene oxide copolymer 
polyoxyethylated polyols (e.g. glycerol) and polyvinyl alcohol. The selected polymer is 
usually modified to have a single reactive group, such as an active ester for acylation or 

25 an aldehyde for alkylation, so that the degree of polymerization may be controlled. 
Polymers may be of any molecular weight, and may be branched or unbranched, and 
mixtures of such polymers may also be used. When the chemically modified NgR 
polymer is destined for therapeutic use, pharmaceutically acceptable polymers will be 
selected for use. 

30 When the polymer is to be modified by an acylation reaction, the polymer 

should have a single reactive ester group. Alternatively, if the polymer is to be 
modified by reductive alkylation, the polymer should have a single reactive aldehyde 
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group. A preferred reactive aldehyde is polyethylene glycol propionaldehyde, which is 
water stable, or mono Cl-CIO alkoxy or aryloxy derivatives thereof (see U.S. Patent 
No. 5,252,714, incorporated by reference herein in its entirety). 

Pegylation of NgR polypeptides may be carried out by any of the pegylation 
5 reactions known in the art, as described, for example, in the following references: 
Focus on Growth Factors 3, 4-10 (1992); EP 0 154 316; and EP 0 401 384 (each of 
which is incorporated by reference herein in its entirety). Preferably, the pegylation is 
carried out via an acylation reaction or an alkylation reaction with a reactive 
polyethylene glycol molecule (or an analogous reactive water-soluble polymer). A 

1 0 preferred water-soluble polymer for pegylation of polypeptides such as NgR is 
polyethylene glycol (PEG). As used herein, "polyethylene glycol" is meant to 
encompass any of the forms of PEG that have been used to derivatize other proteins, 
such as mono (Cl-CIO) alkoxy- or aryloxy-polyethylene glycol. 

Chemical derivatization of NgR polypeptides may be performed under any 

1 5 suitable conditions used to react a biologically active substance with an activated 

polymer molecule. Methods for preparing pegylated NgR polypeptides will generally 
comprise the steps of (a) reacting the polypeptide with polyethylene glycol, such as a 
reactive ester or aldehyde derivative of PEG, under conditions whereby NgR 
polypeptide becomes attached to one or more PEG groups, and (b) obtaining the 

20 reaction products. It will be apparent to one of ordinary skill in the art to select the 
optimal reaction conditions or the acylation reactions based on known parameters and 
the desired result. 

Pegylated and other polymenNgR polypeptides may generally be used to treat 
conditions that may be alleviated or modulated by administration of the NgR 

25 polypeptides described herein. However, the chemically-derivatized polymer:NgR 
polypeptide molecules disclosed herein may have additional activities, enhanced or 
reduced biological activity, or other characteristics, such as increased or decreased 
half-life, as compared to the nonderivatized molecules. The NgR polypeptides, 
fragments thereof, variants and derivatives, may be employed alone, together, or in 

30 combination with other pharmaceutical compositions. The cytokines, growth factors, 
antibiotics, antiinflammatories and/or chemotherapeutic agents as is appropriate for the 
indication being treated. 
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The present invention provides compositions comprising purified polypeptides 
of the invention. Preferred compositions comprise, in addition to the polypeptide of 
the invention, a pharmaceutical^ acceptable (i.e., sterile and non-toxic) liquid, 
semisolid, or solid diluent that serves as a pharmaceutical vehicle, excipient or medium, 
5 Any diluent known in the art may be used. Exemplary diluents include, but are not 
limited to, water, saline solutions, polyoxyethylene sorbitan monolaurate, magnesium 
stearate, methyl- and propylhydroxybenzoate, talc, alginates, starches, lactose, sucrose, 
dextrose, sorbitol, mannitol, glycerol, calcium phosphate, mineral oil and cocoa butter. 
Variants that display ligand binding properties of native NgR and are expressed 

10 at higher levels, as well as variants that provide for constitutively active receptors, are 
particularly useful in assays of the invention; the variants are also useful in assays of the 
invention and in providing cellular, tissue and animal models of diseases/conditions 
characterized by aberrant NgR activity. 

With the knowledge of the nucleotide sequence information disclosed in the 

1 5 present invention, one skilled in the art can identify and obtain nucleotide sequences 
which encode NgR from different sources (/.<?., different tissues or different organisms) 
through a variety of means well known to the skilled artisan and as disclosed by, for 
example, Sambrook et al, Molecular Cloning: A Laboratory Manual, Second 
Edition, Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), which is 

20 incorporated herein by reference in its entirety. 

For example, DNA that encodes NgR may be obtained by screening of mRNA, 
cDNA, or genomic DNA with oligonucleotide probes generated from the NgR gene 
sequence information provided herein. Probes may be labeled with a detectable group, 
such as a fluorescent group, a radioactive atom or a chemiluminescent group in 

25 accordance with procedures known to the skilled artisan and used in conventional 
hybridization assays, as described by, for example, Sambrook et al (1989) above. 

A nucleic acid molecule comprising any of the NgR nucleotide sequences 
described above can alternatively be synthesized by use of the polymerase chain 
reaction (PCR) procedure, with the PCR oligonucleotide primers produced from the 

30 nucleotide sequences provided herein. See U.S. Patent Nos. 4,683, 195 to Mullis et al 
and 4,683,202 to Mullis. The PCR reaction provides a method for selectively 
increasing the concentration of a particular nucleic acid sequence even when that 
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sequence has not been previously purified and is present only in a single copy in a 
particular sample. The method can be used to amplify either single- or 
double-stranded DNA The essence of the method involves the use of two 
oligonucleotide probes to serve as primers for the template-dependent, polymerase- 
5 mediated replication of a desired nucleic acid molecule. 

A wide variety of alternative cloning and in vitro amplification methodologies 
are well known to those skilled in the art. Examples of these techniques are found in, 
for example, Berger et al , Guide to Molecular Cloning Techniques, METHODS IN 
ENZYMOLOGY 152 Academic Press, San Diego, CA, which is incorporated herein by 
10 reference in its entirety. 

The nucleic acid molecules of the present invention, and fragments derived 
therefrom, are useful for screening for restriction fragment length polymorphism 
(RFLP) associated with certain disorders, as well as for genetic mapping. 

15 Antibodies 

Also comprehended by the present invention are antibodies (e.g., monoclonal 
and polyclonal antibodies, single chain antibodies, chimeric antibodies, 
bifimctional/bispecific antibodies, humanized antibodies, human antibodies, and 
complementary determining region (CDR)-grafted antibodies, including compounds 

20 which include CDR sequences which specifically recognize a polypeptide of the 

invention) specific for NgR or fragments thereof. Preferred antibodies of the invention 
are human antibodies which are produced and identified according to methods 
described in W093/1 1236, published June 20, 1993, which is incorporated herein by 
reference in its entirety. Antibody fragments, including Fab, Fab', F(ab')2> and F v , are 

25 also provided by the invention. The term "specific for," when used to describe 

antibodies of the invention, indicates that the variable regions of the antibodies of the 
invention recognize and bind NgR polypeptides exclusively (£&, are able to distinguish 
NgR polypeptides from other known NgR polypeptides by virtue of measurable 
differences in binding affinity, despite the possible existence of localized sequence 

30 identity, homology, or similarity between NgR and such polypeptides). 

The antigenic peptide of NgR comprises at least 8 amino acid residues of the 
amino acid sequence shown in SEQ ID NO:2, 4 or 14 and encompasses an epitope of 
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NgR such that an antibody raised against the peptide forms a specific immune complex 
with NgR. Preferably, the antigenic peptide comprises at least 10 amino acid residues, 
more preferably at least 15 amino acid residues, even more preferably at least 20 amino 
acid residues, and most preferably at least 30 amino acid residues. Preferred epitopes 
5 encompassed by the antigenic peptide are regions of Ngjl that are located on the 
surface of the protein, e.g., hydrophilic regions. 

It will be understood that specific antibodies may also interact with other 
proteins (for example, S. aureus protein A or other antibodies in ELIS A techniques) 
through interactions with sequences outside the variable region of the antibodies, and, 

1 0 in particular, in the constant region of the molecule. Screening assays to determine 
binding specificity of an antibody of the invention are well known and routinely 
practiced in the art. For a comprehensive discussion of such assays, see Harlow et al. 
in Antibodies: A Laboratory Manual, Cold Spring Haibor Laboratory Press; Cold 
Spring Harbor, NY (1988), Chapter 6. Antibodies that recognize and bind fragments 

15 of the NgR polypeptides of the invention are also contemplated, provided that the 
antibodies are specific for NgR polypeptides. Antibodies of the invention can be 
produced using any method well known and routinely practiced in the art. 

For the production of polyclonal antibodies, various suitable host animals {e.g., 
rabbit, goat, mouse or other mammal) may be immunized by injection with the native 

20 protein, or a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, recombinantly expressed NgR 
protein or a chemically synthesized NgR polypeptide. The preparation can further 
include an adjuvant. Various adjuvants used to increase the immunological response 
include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., 

25 aluminum hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, dinitrophenol, etc.), human adjuvants such as 
Bacille Calmette-Guerin and Corynebacterium parvum or similar immunostimulatory 
agents. If desired, the antibody molecules directed against NgR can be isolated from 
the mammal (e.g., from the blood) and further purified by well known techniques, such 

30 as protein A chromatography to obtain the IgG fraction. 

The term "monoclonal antibody" or "monoclonal antibody composition," as 
used herein, refers to a population of antibody molecules that contain only one species 
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of an antigen binding site capable of immunoreacting with a particular epitope of NgR. 
A monoclonal antibody composition thus typically displays a single binding affinity for 
a particular NgR protein with which it immunoreacts. For preparation of monoclonal 
antibodies directed towards a particular NgR protein, or derivatives, fragments, 
5 analogs or homologs thereof, any technique that provides for the production of 
antibody molecules by continuous cell line culture may be utilized. Such techniques 
include, but are not limited to, the hybridoma technique (see Kohler and Milstein 
(1975) Nature 256, 495-497); the trioma technique; the human B-cell hybridoma 
technique (see Kozbor et al, (1983) Immunol Today 4, 72) and the EBV hybridoma 

10 technique to produce human monoclonal antibodies (see Cole et al, (1985) in 

MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96). 
Human monoclonal antibodies may be utilized in the practice of the present invention 
and may be produced by using human hybridomas (see Cote et al, (1983) Proc. Natl 
Acad. Set USA 80, 2026-2030) or by transforming human B-cells with Epstein Barr 

1 5 Virus in vitro (see Cole et al , (1985), above). 

According to the invention, techniques can be adapted for the production of 
single-chain antibodies specific to a NgR protein (see e.g., U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of Fab expression libraries 
(see e.g., Huse et al, (1989) Science 246, 1275-1281) to allow rapid and effective 

20 identification of monoclonal F^ fragments with the desired specificity for a NgR 

protein or derivatives, fragments, analogs or homologs thereof. Non-human antibodies 
can be "humanized" by techniques well known in the art. See e.g., U.S. Patent No. 
5,225,539. In one method, the non-human CDRs are inserted into a human antibody 
or consensus antibody framework sequence. Further changes can then be introduced 

25 into the antibody framework to modulate affinity or immunogenicity. Antibody 

fragments that contain the idiotypes to a NgR protein may be produced by techniques 
known in the art including, but not limited to: (i) an F(ab') 2 fragment produced by 
pepsin digestion of an antibody molecule; (ii) an Fab fragment generated by reducing 
the disulfide bridges of an F(ab') 2 fragment; (iii) an Fab fragment generated by the 

30 treatment of the antibody molecule with papain and a reducing agent and (iv) F v 
fragments. 
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Additionally, recombinant anti-NgR antibodies, such as chimeric and 
humanized monoclonal antibodies, comprising both human and non-human portions, 
which can be made using standard recombinant DNA techniques, are within the scope 
of the invention. Such chimeric and humanized monoclonal antibodies can be 
5 produced by recombinant DNA techniques known in the art, for example using 

methods described in PCT International Application No. PCT/US86/02269; European 
Patent Application No. 184,187; European Patent Application No. 171,496; European 
Patent Application No. 173,494; PCT International Publication No. WO 86/01533; 
U.S. Pat. No. 4,816,567; European Patent Application No. 125,023; Better etal, 

10 (1988) Science 240, 1041-1043; Liu et al, (1987) Proc. Natl Acad Sci. USA 84, 
3439.3443; Liu et al, (1987) 1 Immunol 139, 3521-3526; Sun et al, (1987) Proc. 
Natl Acad Set USA 84, 214-218; Nishimura et al, (1987) Cancer Res. 47, 999-1005; 
Wood etal, (1985) Nature 314, 446-449; Shawefa/,. (1988) J. Natl Cancerlnst. 
80, 1553-1559); Monison (\9%5) Science 229, 1202-1207; Oi etal, (1986) 

15 BioTechniques 4, 214; U.S. Patent. No. 5,225,539; Jones et al, (1986) Nature 321, 
552-525; Verhoeyan et al, (1988) Science 239, 1534; and Beidler et al, (1988) J. 
Immunol 141, 4053-4060. 

In a preferred embodiment of the invention a portion of a NgR is joined to an 
Fc portion of an antibody to form a NgR/Fc fusion protein. Preferably, the Ig fusion 

20 protein is soluble. The NgR/Fc fusion protein may be formed by recombinant 

techniques as described above. In one embodiment, a portion of a NgR including the 
entire amino acid sequence of NgR except the C-terminal hydrophobic region is fused 
to an Fc portion of an antibody. In preferred embodiments, the NgR is a human NgR 
and the Fc is also human. More preferably, the human Fc portion is derived from an 

25 IgG antibody. In other embodiments, the N-terminal signal sequence is omitted. Such 
antibodies are useful in binding Nogo to prevent Nogo signaling through the NgR. 

In one embodiment, methods for the screening of antibodies that possess the 
desired specificity include, but are not limited to, enzyme-linked immunosorbent assay 
(ELISA) and other immunologically-mediated techniques known within the art. In a 

30 specific embodiment, selection of antibodies that are specific to a particular domain of 
a NgR protein is facilitated by generation of hybridomas that bind to the fragment of a 
Ngjl protein possessing such a domain. Antibodies that are specific for one or more 
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domains within a NgR protein, e.g., domains spanning the above-identified conserved 
regions of NgRs, or derivatives, fragments analogs or homologs thereof, are also 
provided herein. 

Anti-NgR antibodies may be used in methods known within the art relating to 
5 the localization and/or quantitation of a NgR protein (e.g. , for use in measuring levels 
of the NgR protein within appropriate physiological samples, for use in diagnostic 
methods, for use in imaging the protein, and the like). In a given embodiment, 
antibodies for NgR proteins, or derivatives, fragments analogs or homologs thereof, 
that contain the antibody derived binding domain, are utilized as pharmacologically- 

10 active compounds [hereinafter "Therapeutics"]. 

An anti-NgR antibody (e.g., monoclonal antibody) can be used to isolate NgR 
by standard techniques, such as affinity chromatography or immunoprecipitation. An 
anti-NgR antibody can facilitate the purification of natural NgR from cells and of 
recombinantly produced NgR expressed in host cells. Moreover, an anti-NgR antibody 

15 can be used to detect NgR protein (e.g., in a cellular lysate or cell supernatant) in order 
to evaluate the abundance and pattern of expression of the NgR protein. Anti-NgR 
antibodies can be used diagnostically to monitor protein levels in tissue as part of a 
clinical testing procedure, e.g., to, for example, determine the efficacy of a given 
treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) 

20 the antibody to a detectable substance. Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials and radioactive materials. Examples of suitable enzymes 
include horseradish peroxidase, alkaline phosphatase, p-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic group complexes include 

25 streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin and aequorin, and examples of suitable radioactive material include 

30 125 I, 131 I, 35 Sor 3 H. 

Another aspect of the present invention is directed to methods of inducing an 
immune response in a mammal against a polypeptide of the invention by administering 
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to the mammal an amount of the polypeptide sufficient to induce an immune response. 
The amount will be dependent on the animal species, size of the animal, and the like 
but can be determined by those skilled in the art. 

Another aspect of the invention is directed to anti-idiotypic antibodies and 
5 anti-anti-idiotypic antibodies. An anti-idiotypic antibody is an antibody that recognizes 
determinants of another antibody (a target antibody). Generally, the anti-idiotypic 
antibody recognizes determinants of the antigen-binding site of the target antibody. 
Typically, the target antibody is a monoclonal antibody. An anti-idiotypic antibody is 
generally prepared by immunizing an animal (particularly, mice) of the same species 

10 and genetic type as the source of the target monoclonal antibody, with the target 
monoclonal antibody. The immunized animal mounts an immune response to the 
idiotypic determinants of the target monoclonal antibody and produces antibodies 
against the idiotypic determinants of the target monoclonal antibody. 
Antibody-producing cells, such as splenic cells, of the immunized animal may be used 

15 to generate anti-idiotypic monoclonal antibodies. Furthermore, an anti-idiotypic 
antibody may also be used to immunize animals to produce anti-anti-idiotypic 
antibodies. These immunized animals may be used to generate anti-anti-idiotypic 
monoclonal antibodies using standard techniques. The anti-anti-idiotypic antibodies 
may bind to the same epitope as the original, target monoclonal antibody used to 

20 prepare the anti-idiotypic antibody. The anti-anti-idiotypic antibodies represent other 
monoclonal antibodies with the same antigen specificity as the original target 
monoclonal antibody. 

If the binding of the anti-idiotypic antibody with the target antibody is inhibited 
by the relevant antigen of the target antibody, and if the anti-idiotypic antibody induces 

25 an antibody response with the same specificity as the target antibody, it mimics the 
antigen of the target antibody. Such an anti-idiotypic antibody is an "internal image 
antiidiotype 1 ' and is capable of inducing an antibody response as if it were the original 
antigen. (Bona and Kohler (1984) Anti-idiotypic antibodies and internal image, 
in Monoclonal and anti-idiotypic antibodies: Probes for receptor structure 

30 AND FUNCTION, Venter J.C. et al (Eds), Alan R. Liss, New York, NY, pp 141-149, 
1984). Vaccines incorporating internal image anti-idiotype antibodies have been 
shown to induce protective responses against viruses, bacteria, and parasites (Kennedy 
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etal t (1986)232, 220-223; 1047; McNamara (1985) Science 226, 1325-1326). 
Internal image anti-idiotypic antibodies have also been shown to induce immunity to 
tumor related antigens (Raychauhuri etaL, (1986) J. Immunol 137, 1743-1749; 
Raychauhuriefa/., (1987)7. Immunol 139, 3902-3910; Bhattacharya-Chatterjeee* 
5 al, (1987) J. Immunol 139, 1354-1360; Bhattacharya-Chatterjee et al, (1988) J. 
Immunol 141, 1398-1403; Herlyn. etal (1989) Intern. Rev. Immunol 4, 347-357; 
Chen et al (1990) Celllmm. Immunother. Cancer 351-359; Herlyn et al, (1991) in 
vivo 5, 615-624; Furuya etal. (1992) Anticancer Res. 12, 27-32; Mittelman, A. etal 
(1992) Proc. Natl Acad. ScL, USA 89, 466-470; Durrant. etal, (1994) Cancer Res. 
10 54, 4837-4840; Mittelman. etal (1994) Cancer Res. 54, 415-421; Schmitt. etal 

(1994) Hybridoma 13, 389-396; Chakrobarty. etal (1995)7. Immunother. 18, 
95-103; Chakrobarty. etal. (1995) Cancer Res. 55, 1525-1530; Foon, K. A. etal 

(1995) Clin. Cancer Res. 1, 1205-1294; Herlyn et al (1995) Hybridoma 14, 159-166; 
Sclebuschef a/. (1995) Hybridoma 14, 167-174; Herlyn. etal (1996) Cancer 

15 Immunol Immunother. 43, 65-76). 

Anti-idiotypic antibodies for NgR may be prepared, for example, by 
immunizing an animal, such as a mouse, with a immunogenic amount of a composition 
comprising NgR2 (SEQ ID NO:2), NgR3 (SEQ ID NOs:4 or 14), or immunogenic 
portion thereof, containing at least one antigenic epitope of NgR. The composition 

20 may also contain a suitable adjuvant, and any carrier necessary to provide 

immunogenicity. Monoclonal antibodies recognizing NgR may be prepared from the 
cells of the immunized animal as described above. A monoclonal antibody recognizing 
an epitope of NgR is then selected and used to prepare a composition comprising an 
immunogenic amount of the anti-NgR monoclonal antibody. Typically, a 25 to 200 |ig 

25 dose of purified anti-NgR monoclonal would be sufficient in a suitable adjuvant. 

Animals may be immunized 2-6 times at 14 to 30 day intervals between doses. 
Typically, animals are immunized by any suitable route of administration, such as 
intraperitoneal, subcutaneous, intravenous or a combination of these. Anti-idiotypic 
antibody production may be monitored during the immunization period using standard 

30 immunoassay methods. Animals with suitable titers of antibodies reactive with the 

target monoclonal antibodies may be reimmunized with the monoclonal antibody used 
as the immunogen three days before harvesting the antibody producing cells. 
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Preferably, spleen cells are used, although other antibody producing cells may be 
selected. Antibody-producing cells are harvested and fused with myeloma cells to 
produce Hybridomas, as described above, and suitable anti-idiotypic 
antibody-producing cells are selected. 
5 Anti-anti-idiotypic antibodies are produced by another round of immunization 

and Hybridoma production by using the anti-idiotypic monoclonal antibody as the 
immunogen. 

Antibodies of the invention are useful for, e.g., therapeutic purposes (by 
modulating activity of NgR), diagnostic purposes to detect or quantitate NgR, and 
10 purification of NgR. Therefore, kits comprising an antibody of the invention for any of 
the purposes described herein are also comprehended. 

Kits 

The present invention is also directed to kits, including pharmaceutical kits. 

1 5 The kits can comprise any of the nucleic acid molecules described above, any of the 
polypeptides described above, or any antibody which binds to a polypeptide of the 
invention as described above, as well appropriate controls, such as positive and/or 
negative controls. The kit preferably comprises additional components, such as, for 
example, instructions, solid support, reagents helpful for quantification, and the like. 

20 For example, the kit can comprise: a labeled compound or agent capable of detecting 
NgR protein or mRNA in a biological sample; means for determining the amount of 
NgR in the sample; and means for comparing the amount of NgR in the sample with a 
standard. The compound or agent can be packaged in a suitable container. 

25 Screening Assays 

The DNA and amino acid sequence information provided by the present 
invention also makes possible identification of binding partner compounds with which 
a NgR polypeptide or polynucleotide will interact. Methods to identify binding partner 
compounds include solution assays, in vitro assays wherein NgR polypeptides are 

30 immobilized and cell-based assays. Identification of binding partner compounds of 
NgR polypeptides provides candidates for therapeutic or prophylactic intervention in 
pathologies associated with NgR normal and aberrant biological activity. 
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The invention also provides a method (also referred to herein as a "screening 
assay") for identifying modulators, le. % candidate or test compounds or agents {e.g., 
peptides, peptidomimetics, small molecules {e.g., molecules of less than 1,000 Daltons) 
or other drugs) that bind to NgR proteins or have a stimulatory- or inhibitory effect on, 
5 for example, NgR expression or NgR activity. 

In one embodiment, the invention provides assays for screening candidate or 
test compounds which bind to or modulate the activity of a NgR protein or 
polypeptide or biologically active portion thereof. The test compounds of the present 
invention can be obtained using any of the numerous approaches in combinatorial 

10 library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are applicable to peptide, 

15 non-peptide oligomer or small molecule libraries of compounds (Lam (1997) 
Anticancer Drug Des. 12, 145). 

Examples of methods for the synthesis of molecular libraries can be found in 
the art, for example in: DeWitt et al 9 (1993) Proc. Natl Acad Set USA 90, 6909; Eih 
etal, (1994) Proc. Natl Acad Set USA 91,11422; Zuckermann etal (1994) J. Med 

20 Chem 37, 2678; Cho et al t (1993) Science 261, 1303; Carrell et al t (1994) Angew 
Chem. Int. Ed Engl. 33, 2059; Carell etal, (1994) Angew Chem. Int. Ed. Engl 33, 
2061; and Gallop etal, (1994) J. Med Chem 37, 1233. 

Libraries of compounds may be presented in solution {e.g., Houghten (1992) 
BioTechniques 13, 412-421), or on beads (Lam (1991) Nature 354, 82-84), on chips 

25 (Fodor (1993) Nature 364, 555-556), bacteria (Ladner, U.S. Patent No. 5,223,409), 
spores (Ladner, above), plasmids (Cull etal (1992) Proc. Natl Acad. Set USA 89, 
1865-1869) or on phage (Scott and Smith (1990) Science 249, 386-390; Devlin (1990) 
Science 249, 404-406; Cwirla etal (1990) Proc. Natl Acad Set USA 87, 6378-6382; 
Felici (1991) J. Mol Biol 222, 301-310; Ladner, above). 



30 
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1. Cell-based Assays 

The invention also provides cell-based assays to identify binding partner 
compounds of a NgR polypeptide. In one embodiment, the invention provides a 
method comprising the steps of contacting a NgR polypeptide expressed on the surface 

5 of a cell with a candidate binding partner compound and detecting binding of the 

candidate binding partner compound to the NgR polypeptide. In another embodiment, 
an assay is a cell-based assay comprising contacting a cell expressing a membrane- 
bound form of NgR protein, or a biologically active portion thereof, on the cell surface 
with a test compound and determining the ability of the test compound to modulate 

1 0 {e.g. , stimulate or inhibit) the activity of the NgR protein or biologically active portion 
thereof. 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses a membrane-bound form of NgR protein, or a biologically active portion 
thereof, on the cell surface is contacted with a test compound and the ability of the test 

1 5 compound to bind to a NgR protein determined. The cell, for example, can be of 

mammalian origin or a yeast cell. Determining the ability of the test compound to bind 
to the NgR protein can be accomplished, for example, by coupling the test compound 
with a radioisotope or en2ymatic label such that binding of the test compound to the 
NgR protein or biologically active portion thereof can be determined by detecting the 

20 labeled compound in a complex. For example, test compounds can be labeled with 
125 1, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope detected by 
direct counting of radioemission or by scintillation counting. Alternatively, test 
compounds can be enzymatically labeled with, for example, horseradish peroxidase, 
alkaline phosphatase or luciferase, and the enzymatic label detected by determination 

25 of conversion of an appropriate substrate to product. In one embodiment, the assay 
comprises contacting a cell which expresses a membrane-bound form of NgR protein 
or a biologically active portion thereof, on the cell surface with a known compound 
which binds NgR to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with a NgR 

30 protein, wherein determining the ability of the test compound to interact with a NgR 
protein comprises determining the ability of the test compound to preferentially bind to 
NgR or a biologically active portion thereof as compared to the known compound. 
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Determining the ability of the test compound to modulate the activity of NgR 
or a biologically active portion thereof can be accomplished, for example, by 
determining the ability of the NgR protein to bind to or interact with a NgR target 
molecule. As used herein, a "target molecule" is a molecule with which a NgR protein 
5 binds or interacts in nature, for example, a molecule on the surface of a cell which 
expresses a NgR protein, a molecule on the surface of a second cell, a molecule in the 
extracellular milieu, a molecule associated with the internal surface of a cell membrane 
or a cytoplasmic molecule. A NgR target molecule can be a non-NgR molecule or a 
NgR protein or polypeptide of the present invention. In one embodiment, a NgR 

1 0 target molecule is a component of a signal transduction pathway that facilitates 
transduction of an extracellular signal (e.g., a signal generated by binding of a 
compound to a membrane-bound NgR molecule) through the cell membrane and into 
the cell. The target, for example, can be a second intercellular protein that has 
catalytic activity or a protein that facilitates the association of downstream signaling 

15 molecules with NgR. In a preferred embodiment, the detection comprises detecting a 
calcium flux or other physiological event in the cell caused by the binding of the 
molecule. 

Specific binding molecules, including natural ligands and synthetic compounds, 
can be identified or developed using isolated or recombinant NgR products, NgR 
20 variants, or preferably, cells expressing such products. Binding partners are useful for 
purifying NgR products and detection or quantification of NgR products in fluid and 
tissue samples using known immunological procedures. Binding molecules are also 
manifestly useful in modulating (i.e., blocking, inhibiting or stimulating) biological 
activities of NgR, especially those activities involved in signal transduction. 

25 

2. Cell-free Assays 

(a) Direct binding: 

The invention includes several assay systems for identifying NgR binding 
partners. In solution assays, methods of the invention comprise the steps of (a) 
30 contacting a NgR polypeptide with one or more candidate binding partner compounds 
and (b) identifying the compounds that bind to the NgR polypeptide. Identification of 
the compounds that bind the NgR polypeptide can be achieved by isolating the NgR 



WO 02/29059 



PCT/US01/31488 



-73- 

polypeptide/binding partner complex and separating the binding partner compound 
from the NgR polypeptide. An additional step of characterizing the physical, biological 
and/or biochemical properties of the binding partner compound is also comprehended 
in another embodiment of the invention. In one aspect, the NgR polypeptide/binding 
5 partner complex is isolated using an antibody immunospecific for either the NgR 
polypeptide or the candidate binding partner compound. 

In still other embodiments, either the NgR polypeptide or the candidate binding 
partner compound comprises a label or tag that facilitates its isolation, and methods of 
the invention to identify binding partner compounds include a step of isolating the NgR 
1 0 polypeptide/binding partner complex through interaction with the label or tag. An 
exemplary tag of this type is a poly-histidine sequence, generally around six histidine 
residues, that permits isolation of a compound so labeled using nickel chelation. Other 
labels and tags, such as the FLAG® tag (Eastman Kodak, Rochester, NY), well known 
and routinely used in the art, are embraced by the invention. 

15 

(b) Immobilized NgR 

In one variation of an in vitro assay, the invention provides a method 
comprising the steps of (a) contacting an immobilized NgR polypeptide, or a 
biologically active fragment thereof with a candidate binding partner compound and (b) 

20 detecting binding of the candidate compound to the Ngjl polypeptide. In an 

alternative embodiment, the candidate binding partner compound is immobilized and 
i binding of NgR is detected. Immobilization is accomplished using any of the methods 
well known in the art, including covalent bonding to a support, a bead or a 
chromatographic resin, as well as non-covalent, high affinity interactions such as 

25 antibody binding, or use of streptavidin/biotin binding wherein the immobilized 
compound includes a biotin moiety. Binding of a test compound to NgR, or 
interaction of NgR with a target molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for containing the reactants. 
Examples of such vessels include microtiter plates, test tubes, and micro-centrifuge 

30 tubes. In one embodiment, a fusion protein can be provided that adds a domain that 
allows one or both of the proteins to be bound to a matrix. For example, and not by 
way of limitation, GST-NgR fusion proteins or GST-target fusion proteins can be 



WO 02/29059 



PCT/US01/31488 



-74- 

adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtiter plates, that are then combined with the test 
compound or the test compound and either the non-adsorbed target protein or NgR 
protein, and the mixture is incubated under conditions conducive to complex formation 
5 (e.g., at physiological conditions for salt and pH). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound components, the matrix 
immobilized in the case of beads, and the complexes determined either directly or 
indirectly, for example, as described above. Alternatively, the complexes can be 
dissociated from the matrix, and the level of NgR binding or activity determined using 

10 standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either NgR or its target molecule can 
be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated NgR or 
target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) using 

15 techniques well known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, 
IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). Alternatively, antibodies reactive with NgR or target molecules, but which 
do not interfere with binding of the NgR protein to its target molecule, can be 
derivatized to the wells of the plate, and unbound target or NgR trapped in the wells 

20 by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the NgR or target molecule, as well as 
enzyme-linked assays that rely on detecting an enzymatic activity associated with the 
NgR or target molecule. 

25 Detection of binding can be accomplished (i) using a radioactive label on the 

compound that is not immobilized, (ii) using of a fluorescent label on the 
non-immobilized compound, (iii) using an antibody immunospecific for the 
non-immobilized compound, (iv) using a label on the non-immobilized compound that 
excites a fluorescent support to which the immobilized compound is attached, (v) 

30 determining the activity of the NgR, as well as other techniques well known and 
routinely practiced in the art. 
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Determining the activity of the target molecule, for example, may be 
accomplished by detecting induction of a cellular second messenger of the target {i.e. 

intracellular Ca , diacylglycerol, DP 3 , etc.), detecting catalytic/enzymatic activity of the 

i 

target an appropriate substrate, detecting the induction of a reporter gene (comprising 
5 a NgR-responsive regulatory element operatively linked to a nucleic acid encoding a 
detectable marker, e.g., luciferase), or detecting a cellular response, for example, cell 
survival, cellular differentiation, or cell proliferation. 

(c) Competition experiments 

10 In yet another embodiment, the assay comprises contacting the NgR protein or 

biologically active portion thereof with a known compound which binds NgR to form 
an assay mixture, contacting the assay mixture with a test compound, and determining 
the ability of the test compound to interact with a NgR protein, wherein determining 
the ability of the test compound to interact with a NgR protein comprises determining 

15 the ability of the test compound to preferentially bind to NgR or biologically active 
portion thereof as compared to the known compound. 

In yet another embodiment, the cell-free assay comprises contacting the NgR 
protein or biologically active portion thereof with a known compound which binds 
NgR to form an assay mixture, contacting the assay mixture with a test compound, and 

20 determining the ability of the test compound to interact with a NgR protein, wherein 
determining the ability of the test compound to interact with a NgR protein comprises 
determining the ability of the Ngjl protein to modulate the activity of a NgR target 
molecule. 

The cell-free assays of the present invention are amenable to use of both the 
25 soluble form or the membrane-bound form of NgR. In the case of cell-free assays 
comprising the membrane-bound form of NgR, it may be desirable to utilize a 
solubilizing agent such that the membrane-bound form of NgR is maintained in 
solution. Examples of such solubilizing agents include non-ionic detergents such as n- 
octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, 
30 decanoyl-N-methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, 

Isotridecypoly(ethylene glycol ether)^ 3-(3-cholamidopropyl)dimethylamminiol-l- 
propane sulfonate (CHAPS), 3-(3-cholamidopropyl)dimethylamminiol-2-hydroxy-l- 
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propane sulfonate (CHAPSO), or N-dodecyl-N,N-dimethyl-3 -ammonio- 1 -propane 
sulfonate. 

Modulators 

5 Agents that modulate (i.e., increase, decrease, or block) NgR activity or 

expression may be identified by incubating a putative modulator with a cell containing 
a NgR polypeptide or polynucleotide and determining the effect of the putative 
modulator on NgR activity or expression. The selectivity of a compound that 
modulates the activity of NgR can be evaluated by comparing its effects on NgR to its 

10 effect on other NgR compounds. Selective modulators may include, for example, 

antibodies and other proteins, peptides or organic molecules which specifically bind to 
a NgR polypeptide or a NgR-encoding nucleic acid. Modulators of NgR activity will 
be therapeutically useful in treatment of diseases and physiological conditions in which 
normal or aberrant NgR activity is involved. NgR polynucleotides, polypeptides and 

15 modulators may be used in the treatment of such diseases and conditions associated 
with demyelination. NgR polynucleotides and polypeptides, as well as NgR 
modulators, may also be used in diagnostic assays for such diseases or conditions. 

Methods of the invention to identify modulators include variations on any of 
the methods described above to identify binding partner compounds, the variations 

20 including techniques wherein a binding partner compound has been identified and the 
binding assay is carried out in the presence and absence of a candidate modulator. A 
modulator is identified in those instances where binding between the NgR polypeptide 
and the binding partner compound changes in the presence of the candidate modulator 
compared to binding in the absence of the candidate modulator compound. A 

25 modulator that increases binding between the NgR polypeptide and the binding partner 
compound is described as an enhancer or activator, and a modulator that decreases 
binding between the NgR polypeptide and the binding partner compound is described 
as an inhibitor. 

In another embodiment, modulators of NgR expression may be identified in a 
30 method wherein a cell is contacted with a candidate compound and the expression of 
NgR mRNA or protein in the cell is determined. The level of expression of NgR 
mRNA or protein in the presence of the candidate compound is compared to the level 
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of expression of NgR mRNA or protein in the absence of the candidate compound. 
The candidate compound can then be identified as a modulator of NgR expression 
based on this comparison. For example, when expression of NgR mRNA or protein is 
greater (statistically significantly greater) in the presence of the candidate compound 
5 than in its absence, the candidate compound is identified as a stimulator of NgR mRNA 
or protein expression. Alternatively, when expression of NgR mRNA or protein is less 
(statistically significantly less) in the presence of the candidate compound than in its 
absence, the candidate compound is identified as an inhibitor of NgR mRNA or protein 
expression. The level of NgR mRNA or protein expression in the cells can be 
10 determined by methods described herein for detecting NgR mRNA or protein. 

High Throughput Screening 

The invention also comprehends high-throughput screening (HTS) assays to 
identify compounds that interact with or inhibit biological activity (i.e., affect 

15 enzymatic activity, binding activity, etc.) of a NgR polypeptide. HTS assays permit 
screening of large numbers of compounds in an efficient manner. Cell-based HTS 
systems are contemplated to investigate NgR receptor-ligand interaction. HTS assays 
are designed to identify "hits" or "lead compounds" having the desired property, from 
which modifications can be designed to improve the desired property. Chemical 

20 modification of the "hit" or "lead compound" is often based on an identifiable 
structure/activity relationship between the "hit" and the NgR polypeptide. 

Another aspect of the present invention is directed to methods of identifying 
compounds that bind to either NgR or nucleic acid molecules encoding NgR, 
comprising contacting NgR, or a nucleic acid molecule encoding the same, with a 

25 compound, and determining whether the compound binds NgR or a nucleic acid 

molecule encoding the same. Binding can be determined by binding assays which are 
well known to the skilled artisan, including, but not limited to, gel-shift assays, 
Western blots, radiolabeled competition assay, phage-based expression cloning, 
co-fractionation by chromatography, co-precipitation, cross linking, interaction 

30 trap/two-hybrid analysis, southwestern analysis, ELISA, and the like, which are 

described in, for example, Ausubel et al (Eds.), Current Protocols in Molecular 
BIOLOGY, 1999, John Wiley & Sons, NY, which is incorporated herein by reference in 
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its entirety. The NgR proteins, for example, can be used as "bait proteins" in a two- 
hybrid assay or three hybrid assay (see, e.g., U.S. Patent No. 5,283,3 17; Zervos et al, 
(1993) Cell 72, 223-232; Madura et al. 9 (1993) J. Biol. Chem. 268, 12046-12054; 
Bartel etal, (1993) BioTechniques 14, 920-924; Iwabuchi etaL, (1993) Oncogene 8, 
5 1693-1696; and Brent WO 94/10300), to identify other proteins that bind to or interact 
with NgR ('TN[gR-binding proteins" or "NgR-bp") and modulate NgR activity. Such 
NgR-binding proteins are also likely to be involved in the propagation of signals by the 
NgR proteins as, for example, upstream or downstream elements of the NgR pathway. 
Other assays may be used to identify specific ligands of a NgR receptor, 

10 including assays that identify ligands of the target protein through measuring direct 
binding of test ligands to the target protein, as well as assays that identify ligands of 
target proteins through affinity ultrafiltration with ion spray mass spectroscopy/HPLC 
methods or other physical and analytical methods. Alternatively, such binding 
interactions are evaluated indirectly using the yeast two-hybrid system described in 

15 Fields et al, (1989) Nature 340, 245-246, and Fields et al 7 (1994) Trends Genet. 10, 
286-292, both of which are incorporated herein by reference. The two-hybrid system 
is a genetic assay based on the modular nature of most transcription factors used for 
detecting interactions between two proteins or polypeptides. It can be used to identify 
proteins that bind to a known protein of interest, or to delineate domains or residues 

20 critical for an interaction. Variations on this methodology have been developed to 
clone genes that encode DNA binding proteins, to identify peptides that bind to a 
protein, and to screen for drugs. The two-hybrid system exploits the ability of a pair of 
interacting proteins to bring a transcription activation domain into close proximity with 
a DNA binding domain that binds to an upstream activation sequence (UAS) of a 

25 reporter gene, and is generally performed in yeast. The assay requires the construction 
of two hybrid genes encoding (1) a DNA-binding domain that is fused to a first protein 
and (2) an activation domain fused to a second protein. The DNA-binding domain 
targets the first hybrid protein to the UAS of the reporter gene; however, because most 
proteins lack an activation domain, this DNA-binding hybrid protein does not activate 

30 transcription of the reporter gene. The second hybrid protein, which contains the 

activation domain, cannot by itself activate expression of the reporter gene because it 
does not bind the UAS. However, when both hybrid proteins are present, the 
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noncovalent interaction of the first and second proteins tethers the activation domain 
to the UAS, activating transcription of the reporter gene. For example, when the first 
protein is a NgR gene product, or fragment thereof, that is known to interact with 
another protein or nucleic acid, this assay can be used to detect agents that interfere 
5 with the binding interaction. Expression of the reporter gene is monitored as different 
test agents are added to the system. The presence of an inhibitory agent results in lack 
of a reporter signal. The compounds to be screened include (which may include 
compounds that are suspected to bind NgR, or a nucleic acid molecule encoding the 
same), but are not limited to, extracellular, intracellular, biological or chemical origin. 

10 The function of the NgR gene product is unclear and no ligands have yet been 

found which bind the gene product. The yeast two-hybrid assay is useful to identify 
proteins that bind to the gene product. In an assay to identify proteins that bind to a 
NgR receptor, or fragment thereof, a fusion polynucleotide encoding both a NgR 
receptor (or fragment) and a UAS binding domain (/.e., a first protein) may be used. 

15 In addition, a large number of hybrid genes each encoding a different second protein 
fused to an activation domain are produced and screened in the assay. Typically, the 
second protein is encoded by one or more members of a total cDNA or genomic DNA 
fusion library, with each second protein-coding, region being fiised to the activation 
domain. This system is applicable to a wide variety of proteins, and it is not even 

20 necessary to know the identity or function of the second binding protein. The system 
. is highly sensitive and can detect interactions not revealed by other methods; even 
transient interactions may trigger transcription to produce a stable mRNA that can be 
repeatedly translated to yield the reporter protein. 

Other assays may be used to search for agents that bind to the target protein. 

25 One such screening method to identify direct binding of test ligands to a target protein 
is described in U.S. Patent No. 5,585,277, incorporated herein by reference. This 
method relies on the principle that proteins generally exist as a mixture of folded and 
unfolded states, and continually alternate between the two states. When a test ligand 
binds to the folded form of a target protein (i.e., when the test ligand is a ligand of the 

30 target protein), the target protein molecule bound by the ligand remains in its folded 
state. Thus, the folded target protein is present to a greater extent in the presence of a 
test ligand which binds the target protein, than in the absence of a ligand. Binding of 
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the ligand to the target protein can be determined by any method which distinguishes 
between the folded and unfolded states of the target protein. The function of the 
target protein need not be known in order for this assay to be performed. Virtually any 
agent can be assessed by this method as a test ligand, including, but not limited to, 
5 metals, polypeptides, proteins, lipids, polysaccharides, polynucleotides and small 
organic molecules. 

Another method for identifying ligands of a target protein is described in 
Wieboldt et al (1997) Anal Chem. 69:1683-1691, incorporated herein by reference. 
This technique screens combinatorial libraries of 20-30 agents at a time in solution 

10 phase for binding to the target protein. Agents that bind to the target protein are 
separated from other library components by simple membrane washing. The 
specifically selected molecules that are retained on the filter are subsequently liberated 
from the target protein and analyzed by HPLC and pneumatically assisted electrospray 
(ion spray) ionization mass spectroscopy. This procedure selects library components 

1 5 with the greatest affinity for the target protein, and is particularly useful for small 
molecule libraries. 

The methods of the invention also embrace ligands, especially neuropeptides, 
that are attached to a label, such as a radiolabel (e.g., 125 I, 35 S, 32 P, 33 P, 3 H), a 
fluorescence label, a chemiluminescent label, an enzymic label and an immunogenic 

20 label. Modulators falling within the scope of the invention include, but are not limited 
to, non-peptide molecules such as non-peptide mimetics, non-peptide allosteric 
effectors, and peptides. The NgR polypeptide or polynucleotide employed in such a 
test may either be free in solution, attached to a solid support, borne on a cell surface 
or located intracellularly or associated with a portion of a cell. One skilled in the art 

25 can, for example, measure the formation of complexes between NgR and the 
compound being tested. Alternatively, one skilled in the art can examine the 
diminution in complex formation between NgR and its substrate caused by the 
compound being tested. 

Another aspect of the present invention is directed to methods of identifying 

30 compounds which modulate (i.e., increase or decrease) activity of NgR comprising 
contacting NgR with a compound, and determining whether the compound modifies 
activity of NgR. The activity in the presence of the test compared is measured to the 
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activity in the absence of the test compound. Where the activity of the sample 
containing the test compound is higher than the activity in the sample lacking the test 
compound, the compound will have increased activity. Similarly, where the activity of 
the sample containing the test compound is lower than the activity in the sample 
5 lacking the test compound, the compound will have inhibited activity. 

The present invention is particularly useful for screening compounds by using 
NgR in any of a variety of drug screening techniques. The compounds to be screened 
include (which may include compounds which are suspected to modulate NgR 
activity), but are not limited to, extracellular, intracellular, biologic or chemical origin. 

1 0 The NgR polypeptide employed in such a test may be in any form, preferably, free in 
solution, attached to a solid support, borne on a cell surface or located intracellularly. 
One skilled in the art can, for example, measure the formation of complexes between 
NgR and the compound being tested. Alternatively, one skilled in the art can examine 
the diminution in complex formation between Nogo-R and its substrate caused by the 

1 5 compound being tested. 

The activity of NgR polypeptides of the invention can be determined by, for 
example, examining the ability to bind or be activated by chemically synthesized 
peptide ligands. Alternatively, the activity of the NgR can be assayed by examining 
their ability to bind calcium ions, hormones, chemokines, neuropeptides, 

20 neurotransmitters, nucleotides, lipids, odorants and photons. Alternatively, the activity 
of the NgR can be determined by examining the activity of effector molecules 
including, but not limited to, adenylate cyclase, phospholipases and ion channels. 
Thus, modulators of NgR activity may alter a NgR receptor fiinction, such as a binding 
property of a receptor or an activity. In various embodiments of the method, the assay 

25 may take the form of an ion flux assay, a yeast growth assay, a non-hydrolyzable GTP 
assay such as a [ 35 S]-GTP S assay, a cAMP assay, an inositol triphosphate assay, a 
diacylglycerol assay, an Aequorin assay, a Luciferase assay, a FLIPR assay for 
intracellular Ca 2+ concentration, a mitogenesis assay, a MAP Kinase activity assay, an 
arachidonic acid release assay {e.g., using [ 3 H]-arachidonic acid) and an assay for 

30 extracellular acidification rates, as well as other binding or function-based assays of 
NgR activity that are generally known in the art. NgR activity can be determined by 
methodologies that are used to assay for FaRP activity, which is well known to those 
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skilled in the art. Biological activities of NgR receptors according to the invention 
include, but are not limited to, the binding of a natural or an unnatural ligand, as well 
as any one of the functional activities of NgRs known in the art. Non-limiting 
examples of NgR activities include transmembrane signaling of various forms, which 
5 may involve phosphatidylinositol (PI) association and/or the exertion of an influence 
over PI; another exemplary activity of NgRs is the binding of accessory proteins or 
polypeptides that differ from known GPI proteins. 

The modulators of the invention exhibit a variety of chemical structures, which 
can be generally grouped into non-peptide mimetics of natural NgR receptor ligands, 

10 peptide and non-peptide allosteric effectors of NgR receptors, and peptides that may 
function as activators or inhibitors (competitive, uncompetitive and non-competitive) 
(e.g., antibody products) of NgR receptors. The invention does not restrict the 
sources for suitable modulators, which may be obtained from natural sources such as 
plant, animal or mineral extracts, or non-natural sources such as small molecule 

15 libraries, including the products of combinatorial chemical approaches to library 
construction, and peptide libraries. 

Other assays can be used to examine enzymatic activity including, but not 
limited to, photometric, radiometric, HPLC, electrochemical, and the like, which are 
described in, for example, Enzyme Assays: A Practical Approach, Eisenthal and 

20 Danson (Eds.), 1992, Oxford University Press, which is incorporated herein by 
reference in its entirety. 

The use of cDNAs in drug discovery programs is well-known; assays capable 
of testing thousands of unknown compounds per day in high-throughput screens 
(HTSs) are thoroughly documented. The literature is replete with examples of the use 

25 of radiolabeled ligands in HTS binding assays for drug discovery (see Williams (1991) 
Med Res. Rev. y 11, 147-184; Sweetnam etal, (1993) 1 Nat Prod 56, 441-455 for 
review). Recombinant receptors are preferred for binding assay HTS because they 
allow for better specificity (higher relative purity), provide the ability to generate large 
amounts of receptor material, and can be used in a broad variety of formats (see 

30 Hodgson (1992) Bio/Technology 10, 973-980; each of which is incorporated herein by 
reference in its entirety). 
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A variety of heterologous systems is available for functional expression of 
recombinant receptors that are well known to those skilled in the art. Such systems 
include bacteria (Strosberg et al (1992) Trends Pharmacol Sci. 13, 95-98), yeast 
(Pausch (1997) Trends BiotechnoL 15, 487-494), several kinds of insect cells (Vanden 
5 Broeck (1996) Int Rev. Cytol 164, 189-268), amphibian cells (Jayawickreme et al 
(1997) Curr. Opin. Biotechnol 8, 629-634) and several mammalian cell lines (CHO, 
HEK293, COS, etc.; see Gerhardt et al (1997) Eur. J. Pharmacol 334, 1-23). These 
examples do not preclude the use of other possible cell expression systems, including 
cell lines obtained from nematodes (PCT application WO 98/37177). 

10 In preferred embodiments of the invention, methods of screening for 

compounds which modulate NgR activity comprise contacting test compounds with 
NgR and assaying for the presence of a complex between the compound and NgR. In 
such assays, the ligand is typically labeled. After suitable incubation, free ligand is 
separated from that present in bound form, and the amount of free or uncomplexed 

1 5 label is a measure of the ability of the particular compound to bind to NgR. 

In another embodiment of the invention, high throughput screening for 
compounds having suitable binding affinity to NgR is employed. Briefly, large 
numbers of different small peptide test compounds are synthesized on a solid substrate. 
The peptide test compounds are contacted with NgR and washed. Bound NgR is then 

20 detected by methods well known in the art. Purified polypeptides of the invention can 
also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the protein 
and immobilize it on the solid support. 

Generally, an expressed Ngjt can be used for HTS binding assays in 

25 conjunction with its defined ligand. The identified peptide is labeled with a suitable 
radioisotope, including, but not limited to, 125 1, 3 H, 35 S or 32 P, by methods that are 
well known to those skilled in the art. Alternatively, the peptides may be labeled by 
well-known methods with a suitable fluorescent derivative (Baindur et al (1994) Drug 
Dev. Res. 33, 373-398; Rogers (1997) DrugDiscov. Today 2, 156-160). Radioactive 

30 ligand specifically bound to the receptor in membrane preparations made from the cell 
line expressing the recombinant protein can be detected in HTS assays in one of 
several standard ways, including filtration of the receptor-ligand complex to separate 
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bound ligand from unbound ligand (Williams (1991) Med Res. Rev. 11, 147-184; 
Sweetnam et al (1993) J. Nat Prod. 56, 441-455). Alternative methods include a 
scintillation proximity assay (SPA) or a FlashPlate format in which such separation is 
unnecessary (Nakayama (1998) Curr. Opin. Drug Disc. Dev. 1, 85-91 Boss6 et al 
5 (1998) J. Biomol Screening 3, 285-292). Binding of fluorescent ligands can be 
detected in various ways, including fluorescence energy transfer (FRET), direct 
spectrophotofluorometric analysis of bound ligand, or fluorescence polarization 
(Rogers (1997) DrugDiscov. Today 2, 156-160; Mil (1998) Curr. Opin. Drug Disc. 
Dev. 1, 92-97). 

10 Examples of such biological responses include, but are not limited to, the 

following: the ability to survive in the absence of a limiting nutrient in specifically 
engineered yeast cells (Pausch (1997) Trends in Biotechnol 15, 487-494); changes in 
intracellular Ca 2+ concentration as measured by fluorescent dyes (Murphy et al (1998) 
Cur. Opin. Drug Disc. Dev. 1, 192-199). Fluorescence changes can also be used to 

1 5 monitor ligand-induced changes in membrane potential or intracellular pH; an 

automated system suitable for HTS has been described for these purposes (Schroeder 
etal (1996) J. Biomol Screening 1, 75-80). Melanophores prepared from Xenopus 
laevis show a ligand-dependent change in pigment organization in response to 
heterologous NgR activation; this response is adaptable to HTS formats 

20 (Jayawickreme etal (1997) Curr. Opin. Biotechnol 8, 629-634). Assays are also 
available for the measurement of common second messengers, including cAMP, 
phosphoinositides and arachidonic acid, but these are not generally preferred for HTS. 

Preferred methods of HTS employing these receptors include permanently 
transfected CHO cells, in which agonists and antagonists can be identified by the ability 

25 to transduce the signal for the binding of Nogo in membranes prepared from these cells 
through the putative GPI anchor. In another embodiment of the invention, 
permanently transfected CHO cells could be used for the preparation of membranes 
which contain significant amounts of the recombinant receptor proteins; these 
membrane preparations would then be used in receptor binding assays, employing the 

30 radiolabeled ligand specific for the particular receptor. Alternatively, a functional 
assay, such as fluorescent monitoring of ligand-induced changes in internal Ca 2+ 
concentration or membrane potential in permanently transfected CHO cells containing 
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each of these receptors individually or in combination would be preferred for HTS. 
♦ Equally preferred would be an alternative type of mammalian cell, such as HEK293 or 
COS cells, in similar formats. More preferred would be permanently transfected insect 
cell lines, such as Drosophila S2 cells. Even more preferred would be recombinant 
5 yeast cells expressing the Drosophila melanogaster receptors in HTS formats well 
known to those skilled in the art (e.g., Pausch (1997), above). 

The invention contemplates a multitude of assays to screen and identify 
inhibitors of ligand binding to NgR receptors. In one example, the Ngjl receptor is 
immobilized and interaction with a binding partner is assessed in the presence and 

10 absence of a candidate modulator such as an inhibitor compound. In another example, 
interaction between the NgR receptor and its binding partner is assessed in a solution 
assay, both in the presence and absence of a candidate inhibitor compound. In either 
assay, an inhibitor is identified as a compound that decreases binding between the NgR 
receptor and its binding partner. Another contemplated assay involves a variation of 

15 the di-hybrid assay wherein an inhibitor of protein/protein interactions is identified by 
detection of a positive signal in a transformed or transfected host cell, as described in 
PCT publication number WO 95/20652, published August 3, 1995. 

Candidate modulators contemplated by the invention include compounds 
selected from libraries of either potential activators or potential inhibitors. There are a 

20 number of different libraries used for the identification of small molecule modulators, 
including: (1) chemical libraries, (2) natural product libraries, and (3) combinatorial 
libraries comprised of random peptides, oligonucleotides or organic molecules. 
Chemical libraries consist of random chemical structures, some of which are analogs of 
known compounds or analogs of compounds that have been identified as "hits" or 

25 "leads" in other drug discovery screens, some of which are derived from natural 
products, and some of which arise from non-directed synthetic organic chemistry. 
Natural product libraries are collections of microorganisms, animals, plants, or marine 
organisms that are used to create mixtures for screening by: (1) fermentation and 
extraction of broths from soil, plant or marine microorganisms or (2) extraction of 

30 plants or marine organisms. Natural product libraries include polyketides, 

non-ribosomal peptides, and variants (non-naturally occurring) thereof. For a review, 
see Cane et al, Science (1998) 282, 63-68. Combinatorial libraries are composed of 
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large numbers of peptides, oligonucleotides, or organic compounds as a mixture. 
These libraries are relatively easy to prepare by traditional automated synthesis 
methods, PCR, cloning, or proprietary synthetic methods. Of particular interest are 
non-peptide combinatorial libraries. Still other libraries of interest include peptide, 
5 protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and 
polypeptide libraries. For a review of combinatorial chemistry and libraries created 
therefrom, see Myers (1997) Curr. Opin Biotechnol 8, 701-707. Identification of 
modulators through use of the various libraries described herein permits modification 
of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to modulate 
10 activity. 

Still other candidate inhibitors contemplated by the invention can be designed 
and include soluble forms of binding partners, as well as such binding partners as 
chimeric, or fusion, proteins. A "binding partner" as used herein broadly encompasses 
non-peptide modulators, as well as such peptide modulators as neuropeptides other 

15 than natural ligands, antibodies, antibody fragments, and modified compounds. 

comprising antibody domains that are immunospecific for the expression product of the 
identified NgR gene. 

Other embodiments of the invention comprise using competitive screening 
assays in which neutralizing antibodies capable of binding a polypeptide of the 

20 invention specifically compete with a test compound for binding to the polypeptide. In 
this manner, the antibodies can be used to detect the presence of any peptide that 
shares one or more antigenic determinants with NgR. Radiolabeled competitive 
binding studies are described in Lin et al, (1997) Antimicrob. Agents Chemother. 41, 
2127-213 1, the disclosure of which is incorporated herein by reference in its entirety. 

25 In other embodiments of the invention, the polypeptides of the invention are 

employed as a research tool for identification, characterization and purification of 
interacting, regulatory proteins. Appropriate labels are incorporated into the 
polypeptides of the invention by various methods known in the art and the 
polypeptides are used to capture interacting molecules. For example, molecules are 

30 incubated with the labeled polypeptides, washed to remove unbound polypeptides, and 
the polypeptide complex is quantified. Data obtained using different concentrations of 
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polypeptide are used to calculate values for the number, affinity, and association of 
polypeptide with the protein complex. 

Labeled polypeptides are also useful as reagents for the purification of 
molecules with which the polypeptide interacts including, but not limited to, inhibitors. 
5 In one embodiment of affinity purification, a polypeptide is covalently coupled to a 
chromatography column. Cells and their membranes are extracted, and various cellular 
subcomponents are passed over the column. Molecules bind to the column by virtue 
of their affinity to the polypeptide. The polypeptide-complex is recovered from the 
column, dissociated and the recovered molecule is subjected to protein sequencing. 
10 This amino acid sequence is then used to identify the captured molecule or to design 
degenerate oligonucleotides for cloning the corresponding gene from an appropriate 
cDNA library. 

Alternatively, compounds may be identified which exhibit similar properties to 
the ligand for the NgR of the invention, but which are smaller and exhibit a longer half 

15 time than the endogenous ligand in a human or animal body. When an organic 
compound is designed, a molecule according to the invention is used as a "lead" 
compound. The design of mimetics to known pharmaceutically active compounds is a 
well-known approach in the development of pharmaceuticals based on such "lead" 
compounds. Mimetic design, synthesis and testing are generally used to avoid 

20 randomly screening a large number of molecules for a target property. Furthermore, 
structural data deriving from the analysis of the deduced amino acid sequences 
encoded by the DNAs of the present invention are useful to design new drugs, more 
specific and therefore with a higher pharmacological potency. 

Comparison of the protein sequence of the present invention with the 

25 sequences present in all the available databases showed a significant homology with the 
transmembrane portion of G protein coupled receptors. Accordingly, computer 
modeling can be used to develop a putative tertiary structure of the proteins of the 
invention based on the available information of the transmembrane domain of other 
proteins. Thus, novel ligands based on the predicted structure of NgR can be 

30 designed. 

This invention further pertains to novel agents identified by the above-described 
screening assays and uses thereof for treatments as described herein. 



WO 02/29059 



PCT/US01/31488 



-88- 

Compositions and Pharmaceutical Compositions 

In a particular embodiment, the novel molecules identified by the screening 
methods according to the invention are low molecular weight organic molecules, in 
which case a composition or pharmaceutical composition can be prepared thereof for 
5 oral or parenteral administration. The compositions, or pharmaceutical compositions, 
comprising the nucleic acid molecules, vectors, polypeptides, antibodies and 
compounds identified by the screening methods described herein, typically comprise 
the nucleic acid molecule, protein, or antibody and a phannaceutically acceptable 
carrier. As used herein, "phannaceutically acceptable carrier" is intended to include 

10 any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents, and the like, compatible with pharmaceutical 
administration. The nature of the carrier or other ingredients will depend on the 
specific route of administration and particular embodiment of the invention to be 
administered. Examples of techniques and protocols that are useful in this context are, 

15 inter alia , found in Remington's PHARMACEUTICAL SCIENCES, 16th ed., (1980) Osol, 
A (Ed.), which is incorporated herein by reference in its entirety. Preferred examples 
of such carriers or diluents include, but are not limited to, water, saline, Ringer's 
solution, dextrose solution and 5% human serum albumin. Liposomes and 
non-aqueous vehicles such as fixed oils may also be used. The use of such media and 

20 agents for pharmaceutically active substances is well known in the art. Except insofar 
as any conventional media or agent is incompatible with the active compound, use 
thereof in the compositions is contemplated. Supplementary active compounds can 
also be incorporated into the compositions. 

A pharmaceutical composition of the invention is formulated to be compatible 

25 with its intended route of administration. Examples of routes of administration include 
oral and parenteral (e.g., intravenous, intradermal, subcutaneous, inhalation, 
transdermal (topical), transmucosal and rectal administration). Solutions or 
suspensions used for parenteral, intradermal or subcutaneous application can include 
the following components: a sterile diluent such as water for injection, saline solution, 

30 fixed oils, polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 
antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
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acid; buffers such as acetates, citrates or phosphates, and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. The pH can be adjusted with acids or 
bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can 
be enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
5 plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersion. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 

10 water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline (PBS). 
In all cases, the composition must be sterile and should be fluid to the extent that easy 
syringeability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 

15 example, water, ethanol, polyol (for example, glycerol, propylene glycol and liquid 
polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity 
can be maintained, for example, by the use of a coating such as lecithin, by the 
maintenance of the required particle size in the case of dispersion and by the use of 
surfactants. Prevention of the action of microorganisms can be achieved by various 

20 antibacterial and antifimgal agents, for example, parabens, chlorobutanol, phenol, 

ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium 
chloride in the composition. Prolonged absorption of the injectable compositions can 
be brought about by including in the composition an agent which delays absorption, for 

25 example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 
compound (e.g., a NgR protein or anti-NgR antibody) in the required amount in an 
appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by 

30 incorporating the active compound into a sterile vehicle that contains a basic dispersion 
medium and the required other ingredients from those enumerated above. In the case 
of sterile powders for the preparation of sterile injectable solutions, methods of 
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preparation are vacuum drying and freeze-drying that yields a powder of the active 
ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 
5 can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients 
and used in the form of tablets, troches or capsules. Oral compositions can also be 
prepared using a fluid carrier for use as a mouthwash, wherein the compound in the 
fluid carrier is applied orally and swished and expectorated or swallowed. 
10 Pharmaceutical^ compatible binding agents, and/or adjuvant materials can be included 
as part of the composition. The tablets, pills, capsules, troches and the like can contain 
any of the following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel or corn starch; a lubricant 
such as magnesium stearate or Sterotes; a glidant such as colloidal silicon dioxide; a 
sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from pressured container or dispenser which contains a suitable 
propellant, e.g., a gas such as carbon dioxide or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. 
For transmucosal or transdermal administration, penetrants appropriate to the barrier 
to be permeated are used in the formulation. Such penetrants are generally known in 
the art, and include, for example, for transmucosal administration, detergents, bile 
salts, and fusidic acid derivatives. Transmucosal administration can be accomplished 
through the use of nasal sprays or suppositories. For transdermal administration, the 
active compounds are formulated into ointments, salves, gels or creams as generally 
known in the art. 

The compounds can also be prepared in the form of suppositories {e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 
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In one embodiment, the active compounds are prepared with carriers that will 
protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
5 polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the 
art. The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutically 

10 acceptable carriers. These can be prepared according to methods known to those 
skilled in the art, for example, as described in U.S. Patent No. 4,522,81 1 . It is 
especially advantageous to formulate oral or parenteral compositions in dosage unit 
form for ease of administration and uniformity of dosage. Dosage unit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be 

15 treated; each unit containing a predetermined quantity of active compound calculated 
to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active 
compound and the particular therapeutic effect to be achieved. 

20 The nucleic acid molecules of the invention can be inserted into vectors and 

used as gene therapy vectors. Gene therapy vectors can be delivered to a subject by 
any of a number of routes, e.g., as described in U.S. Patent No. 5,703,055. Delivery 
can thus also include, e.g., intravenous injection, local administration (see U.S. Patent 
No. 5,328,470) or stereotactic injection (see e.g., Chen et al (1994) Proc. Natl Acad. 

25 Set USA 91, 3054-3057). The pharmaceutical preparation of the gene therapy vector 
can include the gene therapy vector in an acceptable diluent, or can comprise a slow 
release matrix in which the gene delivery vehicle is imbedded. Alternatively, where the 
complete gene delivery vector can be produced intact from recombinant cells, e.g., 
retroviral vectors, the pharmaceutical preparation can include one or more cells that 

3 0 produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack or 
dispenser together with instructions for administration. 
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The dosage of these low molecular weight compounds will depend on the 
disease state or condition to be treated and other clinical factors such as weight and 
condition of the human or animal and the route of administration of the compound. 
For treating human or animals, between approximately 0.5 mg/kg of body weight to 
5 500 mg/kg of body weight of the compound can be administered. Therapy is typically 
administered at lower dosages and is continued until the desired therapeutic outcome is 
observed. 

Another aspect of the present invention is the use of the NgR nucleotide 
sequences disclosed herein for identifying homologs of the Nogo-R, in other animals, 

10 including but not limited to humans and other mammals and invertebrates. Any of the 
nucleotide sequences disclosed herein, or any portion thereof, can be used, for 
example, as probes to screen databases or nucleic acid libraries, such as, for example, 
genomic or cDNA libraries, to identify homologs using screening procedures well 
known to those skilled in the art. Accordingly, homologs having at least 50%, more 

15 preferably at least 60%, more preferably at least 70%, more preferably at least 80%, 
more preferably at least 90%, more preferably at least 95%, and most preferably at 
least 100% homology with NgR sequences can be identified. 

The present compounds and methods, including nucleic acid molecules, 
polypeptides, antibodies, compounds identified by the screening methods described 

20 herein, have a variety of pharmaceutical applications and may be used, for example, to 
treat or prevent unregulated cellular growth, such as cancer cell and tumor growth. In 
a particular embodiment, the present molecules are used in gene therapy. For a review 
of gene therapy procedures, see e.g. Anderson Science (1992) 256, 808-813, which is 
incorporated herein by reference in its entirety. 

25 The present invention also encompasses a method of agonizing (stimulating) or 

antagonizing a NgR natural binding partner associated activity in a mammal comprising 
administering to said mammal an agonist or antagonist to one of the above disclosed 
polypeptides in an amount sufficient to effect said agonism or antagonism. One 
embodiment of the present invention, then, is a method of treating diseases in a 

30 mammal with an agonist or antagonist of the protein of the present invention 
comprising administering the agonist or antagonist to a mammal in an amount 
sufficient to agonize or antagonize NgR-associated functions. 
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Methods of determining the dosages of compounds to be administered to a 
patient and modes of administering compounds to an organism are disclosed in U.S. 
Application Serial No. 08/702,282, filed August 23, 1996, and International patent 
publication number WO 96/22976, published August 1, 1996, both of which are 
5 incorporated herein by reference in their entirety, including any drawings, figures or 
tables. Those skilled in the art will appreciate that such descriptions are applicable to 
the present invention and can be easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used and the size and physiological condition 

10 of the patient. Therapeutically effective doses for the compounds described herein can 
be estimated initially from cell culture and animal models. For example, a dose can be 
formulated in animal models to achieve a circulating concentration range that initially 
takes into account the IC 50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

1 5 Plasma half-life and biodistribution of the drug and metabolites in the plasma, 

tumors and major organs can also be determined to facilitate the selection of drugs 
most appropriate to inhibit a disorder. Such measurements can be carried out. For 
example, HPLC analysis can be performed on the plasma of animals treated with the 
drug and the location of radiolabeled compounds can be determined using detection 

20 methods such as X-ray, CAT scan and MRI. Compounds that show potent inhibitory 
activity in the screening assays, but have poor pharmacokinetic characteristics, can be 
optimized by altering the chemical structure and retesting. In this regard, compounds 
displaying good pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell 

25 composition. For example, toxicity studies can be carried out in a suitable animal 
model as follows: (1) the compound is administered to mice (an untreated control 
mouse should also be used); (2) blood samples are periodically obtained via the tail 
vein from one mouse in each treatment group; and (3) the samples are analyzed for red 
and white blood cell counts, blood cell composition and the percent of lymphocytes 

30 versus polymorphonuclear cells. A comparison of results for each dosing regime with 
the controls indicates if toxicity is present. 
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At the termination of each toxicity study, further studies can be carried out by 
sacrificing the animals (preferably, in accordance with the American Veterinary 
Medical Association guidelines Report of the American Veterinary Medical Assoc. 
Panel on Euthanasia, (1993) J. Am. Vet Med. Assoc. 202:229-249). Representative 
5 animals from each treatment group can then be examined by gross necropsy for 

immediate evidence of metastasis, unusual illness or toxicity. Gross abnormalities in 
tissue are noted and tissues are examined histologically. Compounds causing a 
reduction in body weight or blood components are less preferred, as are compounds 
having an adverse effect on major organs. In general, the greater the adverse effect the 

10 less preferred the compound. 

For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and 
most preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided 
plasma levels of the active moiety are sufficient to maintain therapeutic effectiveness. 

15 Plasma levels should reflect the potency of the drug. Generally, the more potent the 
compound the lower the plasma levels necessary to achieve efficacy. 

NgR mRNA transcripts have been found in the brain and heart. SEQ ID NOs: 
1 and/or, 3 will, as detailed above, enable screening the endogenous 
neurotransmitters/hormones/ligands which activate, agonize, or antagonize NgR and 

20 for compounds with potential utility in treating disorders including CNS disorders 
{e.g., stroke) and degenerative disorders such as those associated with demyelination. 

For example, NgR receptor activation may mediate the prevention of neurite 
outgrowth. Inhibition would be beneficial in both chronic and acute brain injury. See, 
e.g., Donovan et al, (1997) J. Neunosci. 17, 5316-5326; Turgeon et al, (1998) J. 

25 Neuroscl 18, 6882-6891; Smith-Swintosky etal, (1997),/ Neurochem. 69, 

1890-1896; Gill et al, (1998) Brain Res. 797, 321-327; Suidan etal, (1996) Semin. 
Thromb. Hemost 22, 125-133. 
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Pharmacogenomics 

Agents, or modulators that have a stimulatory or inhibitory effect on NgR 
activity (e.g., NgR gene expression), as identified by a screening assay described herein 
can be administered to individuals to treat (prophylactically or therapeutically) 
5 disorders (e.g., a disease condition such as a demyelination disorder) associated with 
aberrant NgR activity. In conjunction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an individual's genotype and that individual's 
response to a foreign compound or drug) of the individual may be considered. 
Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic 

10 failure by altering the relation between dose and blood concentration of the 

pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments 
based on a consideration of the individual's genotype. Such pharmacogenomics can 
further be used to determine appropriate dosages and therapeutic regimens. 

15 Accordingly, the activity of Ng;R protein, expression of NgR nucleic acid or mutation 
content of NgR genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 

20 persons. See e.g., Eichelbaum (1996) Clin. Exp. Pharmacol. Physiol 23, 983-985 and 
Under (1997) Clin. Chem. 43, 254-266. In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body (altered drug action) or genetic conditions 
transmitted as single factors altering the way the body acts on drugs (altered drug 

25 metabolism). These pharmacogenetic conditions can occur either as rare defects or as 
polymorphisms. For example, glucose-6-phosphate dehydrogenase (G6PD) deficiency 
is a common inherited enzymopathy in which the main clinical complication is 
haemolysis after ingestion of oxidant drugs (anti-malarials, sulfonamides, analgesics, 
nitrofiirans) and consumption of fava beans. 

30 As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 
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(NAT 2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an 
explanation as to why some patients do not obtain the expected drug effects or show 
exaggerated drug response and serious toxicity after taking the standard and safe dose 
of a drug. These polymorphisms are expressed in two phenotypes in the population, 
5 the extensive metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is 
different among different populations. For example, the gene coding for CYP2D6 is 
highly polymorphic and several mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 
quite frequently experience exaggerated drug response and side effects when they 

10 receive standard doses. If a metabolite is the active therapeutic moiety, PM show no 
therapeutic response, as demonstrated for the analgesic effect of codeine mediated by 
its CYP2D6-formed metabolite morphine. At the other extreme are the so called ultra- 
rapid metabolizers who do not respond to standard doses. Recently, the molecular 
basis of ultra-rapid metabolism has been identified to be due to CYP2D6 gene 

15 amplification. 

Thus, the activity of NgR protein, expression of NgR nucleic acid, or mutation 
content of NgR genes in an individual can be determined to thereby select appropriate 
agent(s) for therapeutic or prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping of polymorphic alleles 

20 encoding drug-metabolizing enzymes to the identification of an individual's drug 

responsiveness phenotype. This knowledge, when applied to dosing or drug selection, 
can avoid adverse reactions or therapeutic failure and thus enhance therapeutic or 
prophylactic efficiency when treating a subject with a NgR modulator, such as a 
modulator identified by one of the exemplary screening assays described herein. 

25 

Monitoring Clinical Efficacy 

Monitoring the influence of agents (e.g., drugs, compounds) on the expression 
or activity of NgR (e.g., the ability to modulate aberrant cell proliferation and/or 
differentiation) can be applied not only in basic drug screening, but also in clinical 
30 trials. For example, the effectiveness of an agent determined by a screening assay as 
described herein to increase NgR gene expression, protein levels or upregulate NgR 
activity, can be monitored in clinical trials of subjects exhibiting decreased NgR gene 
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expression, protein levels, or downregulated NgR activity. Alternatively, the 
effectiveness of an agent determined by a screening assay to decrease NgR gene 
expression, protein levels, or downregulate NgR activity, can be monitored in clinical 
trials of subjects exhibiting increased NgR gene expression, protein levels, or 
5 upregulated NgR activity. In such clinical trials, the expression or activity of NgR and, 
preferably, other genes that have been implicated in, for example, a disease or disorder, 
can be used as a "read out" or markers of the immune responsiveness of a particular 
cell. 

For example, genes, including NgR, that are modulated in cells by treatment 
10 with an agent (e.g., compound, drug or small molecule) that modulates NgR activity 
(e.g., identified in a screening assay as described herein) can be identified. Thus, to 
study the effect of agents on demyelination disorders, for example, in a clinical trial, 
cells can be isolated and RNA prepared and analyzed for the levels of expression of 
NgR and other genes implicated in the disorder. The levels of gene expression (i.e., a 
15 gene expression pattern) can be quantified by Northern blot analysis or RT-PCR, as 
described herein, or alternatively by measuring the amount of protein produced by one 
of the methods as described herein or by measuring the levels of activity of NgR or 
other genes. In this way, the gene expression pattern can serve as a marker, indicative 
of the physiological response of the cells to the agent. Accordingly, this response state 
20 may be determined before, and at various points during, treatment of the individual 
with the agent. 

In one embodiment, the invention provides a method for monitoring the 
effectiveness of treatment of a subject with an agent (e.g., an agonist, antagonist, 
protein, peptide, peptidomimetic, nucleic acid, small molecule, or other drug candidate 

25 identified by the screening assays described herein) comprising the steps of (;) 

obtaining a pre-administration sample from a subject prior to administration of the 
agent; (if) detecting the level of expression of a NgR protein, mRNA, or genomic 
DNA in the preadministration sample; (Hi) obtaining one or more post-administration 
samples from the subject; (jv) detecting the level of expression or activity of the NgR 

30 protein, mRNA, or genomic DNA in the post-administration samples; (v) comparing 
the level of expression or activity of the NgR protein, mRNA or genomic DNA in the 
pre-administration sample with the NgR protein, mRNA or genomic DNA in the post 
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administration sample or samples; and (vi) altering the administration of the agent to 
the subject accordingly. For example, increased administration of the agent may be 
desirable to increase the expression or activity of NgR to higher levels than detected, 
i.e., to increase the effectiveness of the agent. Alternatively, decreased administration 
5 of the agent may be desirable to decrease expression or activity of NgR to lower levels 
than detected, i.e., to decrease the effectiveness of the agent. 



Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods 

10 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 
associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize (i.e., reduce or inhibit) activity. Therapeutics that 

1 5 antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to; (i) a NgR polypeptide, 
or analogs, derivatives, fragments or homologs thereof; (fl) antibodies to a NgR 
peptide; (///) nucleic acids encoding a NgR peptide; (rv) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (i.e., due to a heterologous 

20 insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout" endogenous function of a NgR peptide by homologous 
recombination (see, e.g., Capecchi (1989) Science 244, 1288-1292); or (v) modulators 
(i.e., inhibitors, agonists and antagonists, including additional peptide mimetic of the 
invention or antibodies specific to a peptide of the invention) that alter the interaction 

25 between a NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase {i.e., are agonists to) activity. Therapeutics that 
upregulate activity may be administered in a therapeutic or prophylactic manner. 

30 Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof; or an agonist that increases 
bioavailability. 
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Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g. , from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art 
5 include, but are not limited to, immunoassays (e.g., by Western blot analysis, 

immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, etc.). 
In one aspect, the invention provides a method for preventing, in a subject, a 

10 disease or condition associated with an aberrant NgR expression or activity, by 

administering to the subject an agent that modulates NgR expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 

15 agent can occur prior to the manifestation of symptoms characteristic of the NgR 
aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or 
NgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

20 Another aspect of the invention pertains to methods of modulating NgR 

expression or activity for therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 

25 protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR 

peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 
or more NgR protein activity. Examples of such stimulatory agents include active 
NgR protein and a nucleic acid molecule encoding NgR that has been introduced into 
the cell. In another embodiment, the agent inhibits one or more NgR protein activity. 

30 Examples of such inhibitory agents include antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro (e.g., by 
culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the 
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agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent (e.g., an agent identified by a screening assay described 
5 herein), or combination of agents that modulates (e.g., upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
administering a NgR protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant NgR expression or activity. 

10 Gene Therapy 

Mutations in the NgR gene that result in loss of normal function of the NgR 
gene product underlie NgR human disease states. The invention comprehends gene 
therapy to restore Ngjl activity to treat those disease states. Delivery of a functional 
NgR gene to appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, 

15 and more particularly viral vectors {e.g., adenovirus, adeno-associated virus, or a 
retrovirus), or ex vivo by use of physical DNA transfer methods {e.g., liposomes or 
chemical treatments). See, for example, Anderson (1998) Nature, supplement to 
392(6679):25-20. For additional reviews of gene therapy technology see Friedmann 
(1989) Science 244, 1275-1281; Verma (1990) Sci. Am. 68-84; and Miller (1992) 

20 Nature 357, 455-460. Alternatively, it is contemplated that in other human disease 
states, preventing the expression of, or inhibiting the activity of, NgR will be useful in 
.'treating disease states. It is contemplated that antisense therapy or gene therapy could 
be applied to negatively regulate the expression of NgR. 

The present invention provides for both prophylactic and therapeutic methods 

25 of treating a subject at risk of (or susceptible to) a disorder or having a disorder 
associated with aberrant NgR expression or activity. 

Diseases and disorders that are characterized by increased (relative to a subject 
not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that antagonize {i.e., reduce or inhibit) activity. Therapeutics that 

30 antagonize activity may be administered in a therapeutic or prophylactic manner. 

Therapeutics that may be utilized include, but are not limited to, (i) a NgR polypeptide, 
or analogs, derivatives, fragments or homologs thereof; (//) antibodies to a NgR 
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peptide; (Hi) nucleic acids encoding a NgR peptide; (zv) administration of antisense 
nucleic acid and nucleic acids that are "dysfunctional" (ie. s due to a heterologous 
insertion within the coding sequences of coding sequences to a NgR peptide) are 
utilized to "knockout" endogenous function of a NgR peptide by homologous 
5 recombination (see, e.g., Capecchi (1989) , above); or (v) modulators (i.e., inhibitors, 
agonists and antagonists, including additional peptide mimetic of the invention or 
antibodies specific to a peptide of the invention) that alter the interaction between a 
NgR peptide and its binding partner. 

Diseases and disorders that are characterized by decreased (relative to a subject 

10 not suffering from the disease or disorder) levels or biological activity may be treated 
with Therapeutics that increase (i.e., are agonists to) activity. Therapeutics that 
upregulate activity may be administered in a therapeutic or prophylactic manner. 
Therapeutics that may be utilized include, but are not limited to, a NgR peptide, or 
analogs, derivatives, fragments or homologs thereof; or an agonist that increases 

15 bioavailability. 

Increased or decreased levels can be readily detected by quantifying peptide 
and/or RNA, by obtaining a patient tissue sample (e.g., from biopsy tissue) and 
assaying it in vitro for RNA or peptide levels, structure and/or activity of the expressed 
peptides (or mRNAs of a NgR peptide). Methods that are well-known within the art 

20 include, but are not limited to, immunoassays (e.g., by Western blot analysis, 

immunoprecipitation followed by sodium dodecyl sulfate (SDS) polyacrylamide gel 
electrophoresis, immunocytochemistry, etc.) and/or hybridization assays to detect 
expression of mRNAs (e.g., Northern assays, dot blots, in situ hybridization, etc.). 
In one aspect, the invention provides a method for preventing, in a subject, a 

25 disease or condition associated with an aberrant NgR expression or activity, by 

administering to the subject an agent that modulates NgR expression or at least one 
NgR activity. Subjects at risk for a disease that is caused or contributed to by aberrant 
NgR expression or activity can be identified by, for example, any or a combination of 
diagnostic or prognostic assays as described herein. Administration of a prophylactic 

30 agent can occur prior to the manifestation of symptoms characteristic of the NgR 

aberrancy, such that a disease or disorder is prevented or, alternatively, delayed in its 
progression. Depending on the type of NgR aberrancy, for example, a NgR agonist or 
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NgR antagonist agent can be used for treating the subject. The appropriate agent can 
be determined based on screening assays described herein. 

Another aspect of the invention pertains to methods of modulating NgR 
expression or activity for therapeutic purposes. The modulatory method of the 
5 invention involves contacting a cell with an agent that modulates one or more of the 
activities of NgR protein activity associated with the cell. An agent that modulates 
NgR protein activity can be an agent as described herein, such as a nucleic acid or a 
protein, a naturally-occurring cognate ligand of a NgR protein, a peptide, a NgR 
peptidomimetic, or other small molecule. In one embodiment, the agent stimulates one 

10 or more NgR protein activity. Examples of such stimulatory agents include active 
NgR protein and a nucleic acid molecule encoding NgR that has been introduced into 
the cell. In another embodiment, the agent inhibits one or more NgR protein activity. 
Examples of such inhibitory agents include antisense NgR nucleic acid molecules and 
anti-NgR antibodies. These modulatory methods can be performed in vitro {e.g., by 

15 culturing the cell with the agent) or, alternatively, in vivo {e.g., by administering the 
agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a NgR protein or nucleic acid molecule. In one embodiment, the method 
involves administering an agent {e.g., an agent identified by a screening assay described 

20 herein), or combination of agents that modulates {e.g., upregulates or downregulates) 
NgR expression or activity. In another embodiment, the method involves 
administering a NgR protein or nucleic acid molecule as therapy to compensate for 
reduced or aberrant NgR expression or activity. 

The present invention is not to be limited in scope by the specific embodiments 

25 described herein. Indeed, various modifications of the invention in addition to those 
described herein will become apparent to those skilled in the art from the foregoing 
description and accompanying figure. Such modifications are intended to fall within 
the scope of the appended claims. 

The following Table 5 contains the sequences of exemplary polynucleotides 

30 and polypeptides of the invention. 
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TABLES 

The following DNA sequence NgR2 <SEQ ID NO. 1> was identified in humans: 



10 



15 



20 



25 



30 



35 



40 



ATGCTGCCCGGGCTCAGGCGCCTGCTGCAAGCTCCCGCCTCGGCCTGCCTCCTGCTGATG 
CTCCTGGCCCTGCCCCTGGCGGCCCCCAGCTGCCCCATGCTCTGCACCTGCTACTCATCC 
CCGCCCACCGTGAGCTGCCAGGCCAACAACTTCTCCTCTGTGCCGCTGTCCCTGCCACCC 
AGCACTCAGCGACTCTTCCTGCAGAACAACCTCATCCGCACGCTGCGGCCAGGCACCTTT 
GGGTCCAACCTGCTCACCCTGTGGCTCTTCTCCAACAACCTCTCCACCATCTACCCGGGC 
ACTITCCGCCACTTGCAAGCCCTGGAGGAGCTGGACCTCGGTGACAACCGGCACCTGCGC 
TCGCTGGAGCCCGACACCTTCCAGGGCCTGGAGCGGCTGCAGTCGCTGCATTTGTACCGC 
TGCCAGCTCAGCAGCCTGCCCGGCAACATCTTCCGAGGCCTGGTCAGCCTGCAGTACCTC 
TACCTCCAGGAGAACAGCCTGCTCCACCTACAGGATGACTTGTTCGCGGACCTGGCCAAC 
CTGAGCCACCTCTTCCTCCACGGGAACCGCCTGCGGCTGCTCACAGAGCACGTGTTTCGC 
GGCCTGGGCAGCCTGGACCGGCTGCTGCTGCACGGGAACCGGCTGCAGGGCGTGCACCGC 
GCGGCCTTCCGCGGCCTCAGCCGCCTCACCATCCTCTACCTGTTCAACAACAGCCTGGCC 
TCGCTGCCCGGCGAGGCGCTCGCCGACCTGCCCTCGCTCGAGTTCCTGCGGCTCAACGCT 
AACCCCTGGGCGTGCGACTGCCGCGCGCGGCCGCTCTGGGCCTGGTTCCAGCGCGCGCGC 
GTGTCCAGCTCCGACGTGACCTGCGCCACCCCCCCGGAGCGCCAGGGCCGAGACCTGCGC 
GCGCTCCGCGAGGCCGACTTCCAGGCGTGTCCGCCCGCGGCACCCACGCGGCCGGGCAGC 
CGCGCCCGCGGCAACAGCTCCTCCAACCACCTGTACGGGGTGGCCGAGGCCGGGGCGCCC 
CCAGCCGATCCCTCCACCCTCTACCGAGATCTGCCTGCCGAAGACTCGCGGGGGCGCCAG 
GGCGGGGACGCGCCTACTGAGGACGACTACTGGGGGGGCTACGGGGGTGAGGACCAGCGA 
GGGGAGCAGATGTGCCCCGGCGCTGCCTGCCAGGCGCCCCCGGACTCCCGAGGCCCTGCG 
CTCTCGGCCGGGCTCCCCAGCCCTCTGCTTTGCCTCCTGCTCCTGGTGCCCCACCACCTC 



The following amino acid sequence <SEQ ID NO. 2> is the predicted amino acid 

sequence derived from the DNA sequence of SEQ ID NO. 1 : 

MLPGLRRLLQAPASACLLLMLLALPLAAPSC 

PMLCTCYSSPPTVSCQANNFSSVPLSLPPST 

QRLFLQNNLIRTLRPGTFGSNLLTLWLFSNN 

LSTIYPGTFRHLQALEELDLGDNRHLRSLEP 

DTFQGLERLQSLHLYRCQLS SLPGNIFRGLV 

SLQYLYLQENSLLHLQDDLFADLANLSHLFL . 

HGNRLRLLTEHVFRGLGSLDRLLLHGNRLQG 

VHRAAFRGLSRLTILYLFNNSLASLPGEALA 

DLPSLEFLRLNANPWACDCRARPLWAWFQRA 

RVSSSDVTCATPPERQGRDLRALREADFQAC 

PPAAPTRPGSRARGNSS SNHLYGVAEAGAPP 

ADPSTLYRDLPAED SRGRQ GGDAPTEDDYWG 

GYGGEDQRGEQMCPGAACQAPPDSRGPALSA 

GLP SPLLCLLLL VPHHL 
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The following DNA sequence NgR3 <SEQ ID NO. 3> was identified in mouse: 

atgtcttggcagtctggaaccacagtgacacaatctcccgtgcaggctgctcaggtctca 
gggtgctgtgtggaattgctgk:tgttgctgctcgk:tggagagctacctctgggtggtggt 
tgtcctcgagactgtgtgtgctaccctgcgcccatgactgtcagctgccaggcacacaac 
tttgctgccatcccggagggc^tcccagaggacagtgagcgcatcttcctgcagaacaat 
cgcatcaccttcctccagcagggccacttcagccccgccatggtcaccctctggatctac 
tccaacaacatcactttcattgctcccaacaccttcgagggctttgtgcatctggaggag 
ctagaccttggagacaaccgacagctgcgaacgctggcacccgagaccttccaaggcctg 
gtgaagcttcacgccctctacctctataagtgtggactgagcgccctgcccgcaggcatc 
tttggtggcctgcacagcctgcagtatctctacttgcaggacaaccatatcgagtacctc 
caagatgacatctttgtggacctggtcaatctcagtcacttgtttctccatggtaacaag 
ctatggagcctgggccaaggcatcttccggggcctggtgaacctggaccggttgctgctg 
catgagaaccagctacagtgggttcaccacaaggctttccatgacctccacaggctaacc 
accctctttctcttcaacaacagcctcactgagctgcagggtgactgtctggcccccctg 
gtggccttggagttccttcgcctcaatgggaatgcttgggactgtggctgccgggcacgt 
tccctgtgggaatggctgcgaaggttccgtggctctagctctgctgtcccctgcgcgacc 
cccgagctgcckk:aaggccaggatctgaagctck:tgagggtggaggacttccggaactgc 
acaggaccagtgtctcctcaccagatcaagtctcacacgcttaccacctctgacagggct 
gcccgcaaggagcaccatccgtcccatggggcctccagggacaaaggccacccacatggc 
catccgcctggctccaggtcaggttacaagaaggcaggcaagaactgcaccagccacagg 
aaccggaaccagatctctaaggtgagctctgggaaagagcttaccgaactgcaggactat 
gcccccgactatcagcacaagttcagctttgacatcatgcccaccgcacgacccaagagg 

AAGGGCAAGTGTGCTCGCAGGACCCCCATCCGTGCCCCCAGTGGGGTG<^GCACKjCATCC 
TCAGGCACGGCCCTTGGGGCCCCACTCCTGGCCTGGATACTGGGGCTGGCAGTCACTCTC 
CGC 
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The following protein sequence <SEQ ID NO. 4> is deduced protein of SEQ ID 
NO:3; 

MSWQSGTTVTQSPVQAAQVSGCCVELLLLLL 
AGELPLGGGCPRDCVCYPAPMTVSCQAHNFA 
AIPEGIPEDSERIFLQNNRITFLQQGHFSPA 
MVTLWIYSNNI T FI APNTFE GF VHLEE LD L G 
DNRQLRTLAPETFQGLVKLHALYLYKCGLSA 
LPAGIFGGLHSLQYLYLQDNHIEYLQDDIFV 
DL VNLSHLFLHGNKLWSLGQGIFRGLVNLDR 
LLLHENQLQ W VHHK AFHDLHRLTTLFLFNNS 
LTELQGDCLAPLVALEFLRLNGNAWDCGCRA 
RSLWEWLRRFRGSSSAVPCATPELRQGQDLK 
LLRVEDFRN CT GP VSPHQIKS HT LT T S D R A A 
RKEHHPSHGASRDKGHPHGHPPGSRSGYKKA 
GKNCTSHRNRNQISKVSSGKELTELQD YAPD 
YQHKFSFDIMPT ARPKRKGKC ARRTPIR APS 
GVQQASSGTALGAPLLAWILGLAVTLR 
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The following protein sequence <SEQ ID NO. 5> is NgRl from humans: 

MKRAS AGGSRLLAWVLWLQAWQVAAPCPGA 
C 

CYNEPKVTTSCPQQGLQAVPVGIPAASQRI 
FLHGNRISHVPAASFRACRNLTILWLHSNVL 
ARIDAAAFTGLALLEQLDLSDNAQLRSVDPA 
TFHGLGRLHTLHLDRCGLQELGPGLFRGLAA 
LQYLYLQDNALQALPDDTFRDLGNLTHLFLH 
GNRISSVPERAFRGLHSLDRLLLHQNRVAHV 
HPHAFRDLGRLMTLYLFANNLS ALPTEALAP 
LR ALQ YLRLNDNP W VCD CR ARPL W A WLQKFR 
G S S SE VP C SLP QRL A GRD LKRL A ANDL QGC A 
VATGPYHPIWTGRATDEEPLGLPKCCQPD A A 
DKAS VLEPGRPAS AGN ALKGRVPPGD SPPGN 
GSGPRHINDSPFGTLPGSAEPPLTAVRPEGS 
EPPGFPTSGPRRRPGCSRKNRTRSHCRLGQ A 
GSGGGGTGDSEGSGALPSLTCSLTPLGLALV 
LWTVLGPC 

The following amino acid sequence <SEQ ID NO:6> is a Consensus Sequence of 
NgR based on homology with NgRl 

CPXXCXCYXXPXXTXSCXXXXXXXXPX 
XXPXXXXRXFLXXNXIXXXXXXXFXXXXXXXXLWX 
XSNXXXXIXXXXFXXXXXLEXLDLXDNXXLR 
XXXPXTFXGLXXLXLXLXXCXLXXLXXXXFX 
GLXXLQYLYLQXNXXXXLXDDXFXDLXNLXH 
LFLHGNXXXXXXXXXFRGLXXLDRLLLHXNX 
XXXVHXXAFXXLXRLXXLXLFXNXLXXLXXX 
XLAXLXXLXXLRLNXNXWXCXCRARXLWXWX 
XXXRXSSSXVXCXXPXXXXGXDLXXLXXXDX 
XXCXXXXXPXXPXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXGXXXXXXXXXXXX 
PPXXXSXXXXXXXXXXXXXXXXXXXXXXXXX 
XXXXXXXXXXXXXXXXXXXXXXXXXXXRXXX 
XXXXXXXXXXXXXXXXXXXXXXXXLXXXXX 
XX X XX L 



The following protein sequence <SEQ ID NO:7> is the 66 amino acid active domain 
ofNogo: 

RIYKGV1QAIQKSDEGHPFRAYLESEVAISE 
ELVQKYSNSALGHVNCTIKELRRLFLVDDLV 
DS L K 
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The following protein sequence <SEQ ID NO:8> is the amino acid sequence of the 
mature NgR2: 

CPMLCTCYSSPPTVSCQ.ANNFSSVPLSLPPS 
TQRLFLQNNLIRTLRPGTFGSNLLTLWLFSN 
NLSTIYPGTFRHLQALEELDLGDNRHLRSLE 
PDTFQGLERLQSLHLYRCQLSSLPGNIFRGL 
VSLQYLYLQENSLLHLQDDLFADLANL SHLF 
LHGNRLRLLTEHVFRGLGSLDRLLLHGNRLQ 
GVHRAAFRGLSRLTILYLFNNSLASLPGEAL 
ADLPSLEFLRLNANPWACD CRARPLW AWFQR 
ARVSSSDVTCATPPERQGRDLRALREADFQA 
CPPAAPTRPGSRARGNSSSNHLYGVAEAGAP 
PADPSTLYRDLPAEDSRGRQGGDAPTEDDYW 
GGYGGEDQRGEQMCPGAACQAPPD SRGPALS 
AGLPSPLLCLLLL VP H H L 



The following protein sequence <SEQ ID NO:9> is the amino acid sequence of the 
mature NgR3: 

CPRDCVCYPAPMTVSCQAHNFAAIPEGIPED 
SERIFLQNNRITFLQQGHFSPAMVTLWIYSN 
NITFIAPNTFEGFVHLEELDLGDNRQLRTLA 
PETFQGLVKLHALYLYKCGLSALPAGIFGGL 
HSLQYLYLQDNHIEYLQDDIFVDLVNLSHLF 
LHGNKLWSLGQGIFRGLVNLDRLLLHENQLQ 
WVHHKAFHDLHRLTTLFLFNNSLTELQGDCL 
APLVALEFLRLNGNAWDCGCRARSLWEWLRR 
FRGSSSAVPCATPELRQGQDLKLLRVEDFRN 
CTGPVSPHQIKSHTLTTSDRAARKEHHPSHG 
ASRDKGHPHGHPPGSRSGYKKAGKNCTSHRN 
RNQISKVSSGKELTELQDYAPDYQHKFSFDI 
MPTARPKRKGKCARRTPIRAPSGVQQASSGT 
A L G A PL L AW I L G L A VT LR 



The following amino acid sequence <SEQ ID NO: 10> is a conserved cysteine motif 
(Cysteine domain 1) of the NgR and homologs based on the Consensus Sequence: 
CPXXCXCYXXPXXTXSC 
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The following amino acid sequence <SEQ ID NO: 1 1> is a conserved cysteine motif 

(Cysteine domain 2) of the NgR and homologs based on the Consensus Sequence: 

NXWXCXCRARXLWXWXXXXRXSSSXVXCXXP 
X X X XGXDLXXLXXXDXXXC 

5 



The following amino acid sequence <SEQ ID NO: 12> is a conserved Leucine-rich 

domain of the NgR and homologs based on the Consensus Sequence: 

RXFLXXNXIXXXXXXXFXXXXXXXXLWXXSN 
10 XXXXIXXXXFXXXXXLEXLDLXDNXXLRXXX 
PXTFXGLXXLXLXLXXCXLXXLXXXXFXGLX 
XLQYLYLQXNXXXXLXDDXFXDLXNLXHLFL 
HGNXXXXXXXXXFRGLXXLDRLLLHXNXXXX 
VHXXAFXXLXRLXXLXLFXNXLXXLXXXXLA 
15 X L XX L XX LR L 



Unless otherwise indicated, X is any amino acid. For example, X where 
indicated may be no amino acid. Additional features of the invention will be apparent 
from the following Examples. Examples 1-5 are actual, while the remaining Examples 

20 are prophetic. 

As shown by the following Examples, a gene encoding novel NgRs have been 
identified by computational analysis of DNA sequence data. The proteins encoded by 
NgR2 and NgR3 have a putative signal sequence, eight leucine-rich repeat domains in 
a conserved leucine-rich region (SEQ ID NO: 12), a conserved cysteine-rich region 

25 (SEQ ID NO: 10) N-terrainal to the leucine-rich region, a second cysteine-rich domain 
(SEQ ID NO: 1 1) C-tenninal to the leucine-rich region, and a putative 
glycophosphatidylinositol-linkage (GPI-linkage) site. NgR2 and NgR3 differ from the 
previously identified NgR sequence. The NgR homologs, when compared to known 
NgRs, show a consensus sequence (SEQ ID NOs:6). The putative mature NgR2 and 

30 NgR3 are shown in Table 5 as SEQ ID NOs: 8 and 9, respectively. 

Example 1: Tblastn query of the HTG database 

The protein sequence for the human NgR (NgRl) (SEQ ID NO:5) was used to 
query the high throughput genomic (HTG) database the use of which is familiar to 
35 those skilled in the art. The HTG database is a part of GenBank, a comprehensive 
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NIH genetic sequence database, which includes an annotated collection of all publicly 
available DNA sequences {Nucleic Acids Res. (2000) 28, 15-8). The HTG database 
includes sequences obtained from genomic DNA. Within genomic DNA, genes are 
typically encoded by multiple segments of DNA called exons. Thus when one aligns a 
5 cDNA sequence (or a protein sequence encoded by a cDNA sequence) to a genomic 
sequence, the sequence will be broken up into segments depending on the number of 
exons in the gene. 

The BLAST algorithm, which stands for Basic Local Alignment Search Tool is 
suitable for determining sequence similarity (Altschul et al, (1990) J. Mol Biol 215, 

10 403-410, which is incorporated herein by reference in its entirety). Software for 
performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information (http://www.ncbi.nlm.nih.gov/). The basic BLAST 
algorithm involves first identifying high scoring sequence pair (HSPs) by identifying 
short words of length W in the query sequence that either match or satisfy some 

15 positive-valued threshold score T when aligned with a word of the same length in a 
database sequence. T is referred to as the neighborhood word score threshold 
(Altschul et al, si/pra). These initial neighborhood word hits act as seeds for initiating 
searches to find HSPs containing them. The word hits are extended in both directions 
along each sequence for as far as the cumulative alignment score can be increased. 

20 Extension for the word hits in each direction are halted when: 1) the cumulative 

alignment score falls off by the quantity X from its maximum achieved value; 2) the 
cumulative score goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or 3) the end of either sequence is reached. The 
Blast algorithm parameters W, T and X determine the sensitivity and speed of the 

25 alignment. The Blast program uses as defaults a word length (W) of 1 1, the 

BLOSUM62 scoring matrix (see Henikoff a/., (1992) Proc. Natl Acad Set USA 
89, 10915-10919, which is incorporated herein by reference in its entirety) alignments 
(B) of 50, expectation (E) of 10, M=5, N=4, and a comparison of both strands. 

The BLAST algorithm (Karlin etal, (1993) Proc. Natl Acad Set USA 90, 

30 5873-5787, which is incorporated herein by reference) and Gapped BLAST perform a 
statistical analysis of the similarity between two sequences. One measure of similarity 
provided by the BLAST algorithm is the smallest sum probability (P(N)), which 
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provides an indication of the probability by which a match between two nucleotide or 
amino acid sequences would qccur by chance. For example, a nucleic acid is 
considered similar to a NgR gene or cDNA if the smallest sum probability in 
comparison of the test nucleic acid to a NgR nucleic acid is less than about 1, 
5 preferably less than about 0. 1, more preferably less than about 0.01, and most 
preferably less than about 0.001. 

To query the HTG database with the NgR protein sequence, we used a 
variation of the BLAST algorithm known as the tblastn program, which compares a 
protein query sequence against a nucleotide sequence database dynamically translated 

10 in all reading frames (J. Mol Biol (1990) 215, 403-410: Nucleic Acids Res. (1997) 
25, 3389-3402). The results of the tblastn search indicated the presence of genes in 
the database with a significant identity to the NgR. In addition to finding hits to 
genomic clones which contain the human and mouse NgR genes, we found hits to 
clones where the identity was not as high, but still very significant. Three human 

15 clones were found (Accession numbers: AC068514, AC016869, AC013606) with an 
e-value of 4e-43 and one mouse clone was found (Accession No. AC021768) with an 
e-value of le-78. The three human clones all appeared to encode the same gene, so 
further analysis was confined to AC013606. 

20 Example 2: Prediction of the human NgR2 protein sequence (AC013606) 

The human NgR protein sequence aligned with two regions of translated 
sequence from nucleotide sequence AC013606, indicating that the new gene was 
encoded by at least two exons. In order to define the complete gene, we used the 
computer program GENSCAN™ (J. Mol Biol (1997) 268, 78-94) which can identify 

25 complete exon/intron structures of genes in genomic DNA The gene prediction by 
GENESCAN™ contained seven exons. By comparing these predicted exons to the 
NgR, it was concluded that the new human gene contains two of these exons and a 
part of another (containing the initiating methionine). The predicted cDNA (mRNA) 
encoded by these three exons was assembled from AC013606 (HTG11; deposited 

30 March 2000; length - 143899; GenBank release 118.0; SEQ ID NO: 15) by combining 
nucleotides from the three exons whose coordinates are: 123292-123322 (exon 1); 
130035-130516 (exon 2); and 138589-139335 (exon 3). The sequence for this cDNA 
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sequence is SEQ ID NO:l (nucleotide sequence of human NgR2; AC013606). The 
translation of this cDNA provides the protein sequence of human NgR2 (SEQ ID 
NO:2). 

We used the protein sequence of human NgR2 as a query sequence against the 
5 human EST database. A number of hits of high significance were found indicating that 
the NgR2 mRNA is expressed in a number of tissues including fetal brain. 
Furthermore, two of these ESTs provided support for the exon structure that we 
deduced. One EST (Accession No: GBJEST19:AI346757) contains 565 nucleotides 
corresponding to amino acids 84-271 of the human NgR2 (SEQ ID No:4). This spans 

10 the second intron located between amino acids 171 and 172, and provides positive 
evidence for the splicing of exons 2 and 3 at the mRNA level. Another EST 
(GB_EST26:AI929019) contains 545 nucleotides, part of which corresponds to amino 
acids 1-75 of the human NgR2 (SEQ ID NO:2). This spans the first intron located 
between amino acids 10 and 1 1, and provides positive evidence for the splicing of 

1 5 exons 1 and 2 at the mRNA level. 

Example 3: Prediction of the mouse NgR3 protein sequence (AC021768) 

The human NgR protein sequence aligned with only one region of translated 
sequence from nucleotide sequence AC021768, indicating that most of the new mouse 

20 gene was encoded by one large exon. However, upon inspection, the protein encoded 
by this exon was missing an initiating methionine. In order to define the complete 
gene, we used the computer program GENSCAN as described above. The gene 
prediction by GENSCAN contained two exons; the large one found by visual 
inspection and a short one at the 5' end which provided an initiating methionine. The 

25 predicted cDNA (mRNA) encoded by these two exons was assembled from AC021768 
(HTG14; deposited March 2000; length = 215980; GenBank release 118.0; SEQ ID 
NO: 16) by combining nucleotides from the two exons whose coordinates are: the 
complement of 164265-164325 (exon 1); and the complement of 155671-156992 
(exon 2). The sequence for this cDNA sequence is SEQ ID NO:3 (nucleotide 

30 sequence of mouse NgR3; AC021768). The translation of this cDNA provides the 
protein sequence of mouse NgR3 (SEQ ID NO:4). 



WO 02/29059 



PCT/US01/31488 



-111- 

We used the protein sequence of mouse NgR3 as a query sequence against the 
mouse EST database. One hit of high significance was found indicating that the NgR2 
mRNA is expressed in the heart. This EST (GB_EST20:AI428334) contains 463 
nucleotides, part of which correspond to amino acids 45-193 of mouse NgR3 (SEQ ID 
5 NO:4). 

Example 4: Similarity between the NgRs 

An alignment between NgRl and the two new receptors is shown in Fig. 
1 A-1B. The similarities between these proteins include: 
10 (1) The SignalP program, which locates the signal sequence cleavage 

position, predicts a cleavage before the first conserved cysteine in all the proteins. 
Thus the mature protein in all cases will have a cysteine at the N-terminus. 

(2) All proteins contain eight Leucine Rich Repeats (LRR). LRRs are short 
sequence motifs present in a number of proteins with diverse functions and cellular 

15 locations. These repeats are usually involved in protein-protein interactions. Each 
LRR is composed of a beta-alpha unit. 

(3) All three proteins contain a leucine rich repeat N-terminal domain 
(LRRNT), in which four cysteines are conserved. LRRs are often flanked by cysteine 
rich domains at both their N and C termini. 

20 (4) All three proteins contain a LRR C-terminal domain (LRRCT). The 

LRRCTs of the three NgR proteins can be distinguished from those of other LRR 
containing proteins, by the pattern of typtophans and cysteines which are completely 
conserved in this domain. 

(5) All three proteins contain a conserved cysteine in the fourth LRR 
25 domain. 

(6) All three proteins contain a conserved potential glycosylation site in the 
eighth LRR domain. 

(7) NgR2 and NgR3 have a hydrophobic C-terminus, as does NgRl, an 
indication that they probably also undergo a modification similar to NgRl, where a 

30 GPI moiety is covalently linked to a C-terminal amino acid. This allows the protein to 
remain tethered to the cell. 
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Example 5: Preparation of Nogo Proteins 

ANogo binding assay was developed which utilizes a method widely used in 
examining semaphorin and ephrin axonal guidance function (Flanagan & 
Vanderhaeghen (1998) Annu. Rev. Neurosci. 21,3 09-345; Takahashi etal. y (1999) 
5 Cell 99, 59-69), It involves fusing a secreted placental alkaline phosphatase (AP) 

moiety to the ligand in question to provide a biologically active receptor binding agent 
which can be detected with an extremely sensitive colorimetric assay. For Nogo, an 
expression vector is created encoding a signal peptide, a His6 tag for purification, AP, 
and the 66 amino acid active domain of Nogo. The fusion protein can be purified from 

10 the conditioned medium of transfected cells in milligram amounts. This protein is 
biologically active as a growth cone collapsing agent with an EC 50 of 1 nM. 

Alternatively, a glutathione-S-transferase Nogo (GST-Nogo) fusion protein 
may be prepared. For GST-Nogo, an expression vector (e.g., a pGEX vector) is 
created encoding a signal peptide, GST, and the 66 amino acid active domain of Nogo. 

15 GST-Nogo may be purified from the culture medium and used as a GST fusion 

protein, or GST may be cleaved from the Nogo portion of the fusion protein with an 
enzyme that recognizes the specific amino acid cleavage sit engineered between the 
GST portion and the Nogo portion of the fusion protein. Such sites are part of the 
commercially available GST vectors. The specific cleavage sites and enzymes may be 

20 used in accordance with the Manufacturer's specifications. 

It has been found that AP-Nogo is actually slightly more potent than 
GST-Nogo, perhaps because the protein is synthesized in a eukaryotic rather than a 
prokaryotic cell. 

Binding of Nogo to immobilized NgR homologs may be performed in an 
25 ELISA-type assay in which AP-Nogo is allowed to react with an immobilized receptor 
homolog. Specificity of binding may be demonstrated in a competitive binding assay 
using increasing amounts of GST-Nogo in the type of assay to show a decreasing 
amount of binding of AP-Nogo (as judged in the colorimetric assay). 



30 



Example 6: Transfected COS Cell binding Assays 

The homologs of the present invention may be used in transfection studies in 
COS cells to demonstrate binding of Nogo. Specifically, nucleotide sequences 



WO 02/29059 



PCT/US01/31488 



-113- 

encoding NgR2 and NgR3 may be transfected into COS cells using a suitable vector. 
Non-transfected COS-7 cells do not bind AP-Nogo. However, transfection of COS 
cells with nucleic acid sequences encoding NgRs will make them capable of binding 
Nogo. AP alone does not bind with any stable affinity to these transfected cells, 
5 indicating that any affinity of Nogo for NgR2 or NgR3 would be due to the 66 amino 
acids derived from Nogo. Furthemore, specific affinity of Nogo for the Ngjl2 or 
NgR3 proteins may be tested in displacement of AP-Nogo assays using GST-Nogo. 
NgR2 and/or NgR3 may also bind homologs of Nogo, which may also be tested using 
this assay. 

10 

Example 7: Expression of NgR in Human Cell Lines using Northern Blot and a 
Random-Primed Probe 

A Northern blot is purchased from a commercial source, or RNA samples from 
cells of interest are run on an agarose gel and blotted to a membrane using any of the 

1 5 well known techniques for Northern blotting. The blot is probed with a fragment of 
NgR2 (SEQ ID NO: 1) or NgR3 (SEQ ID NO:3). The probe is prepared from 50 ng 
of cDNA labeled by a random-primed method (Feinberg and Vogelstein (1983) Anal 
Biochem. 132, 6-13). Hybridization is carried out at 68'C for 1 hour in 
ExpressHybTM solution (Clontech, Cat. No. 8015-1) followed by washing with 2X 

20 SSC/0.05% SDS at room temperature and two washes with 0. IX SSC/0. 1% SDS at 
50°C. Expression of NgR2 and/or NgR3 can be assessed by the presence of an 
appropriately sized band on the blot. 

Example 8: Cloning of cDNA corresponding to NgRs 

25 

To obtain the full-length clone corresponding to NgR2 from a cDNA library, 
the following method may be used. A cDNA library is generated using standard 
methods from a tissue known to contain NgR2. Such a tissue was identified in 
Example 2. 1 x 10 6 plaque forming units from the cDNA library may be screened in 
30 duplicate on OPTITRAN™ filters. The filters are hybridized with 32 P-labeled 

oligonucleotides that are generated from the ESTs corresponding to portions of NgR2. 
The hybridization reaction may consist of 400 mis plaque screen buffer (50mM Tris pH 
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7.5, 1M NaCl, 0. 1% Sodium pyrophosphate, 0.2% Polyvinylpryolidine and 0.2% 
Ficoll) containing 10% Dextran sulfate and 100ng/ml tRNA and 80 pmol each 
32 P-labeled oligonucleotide at 65°C overnight. The filters are washed twice with 2X 
SSC/1%SDS and twice with IX SSC/1%SDS and exposed to film. Duplicate positives 
5 are purified. DNA from each of these clones is analyzed by restriction enzyme digest 
followed by agarose gel electrophoresis and Southern blotting. The filters are 
hybridized to the P-Iabeled oligonucleotides used for the original hybridization to 
confirm that inserts hybridize to the probe. The insert is then sequenced to confirm that 
it represents the cDNA for NgR2. Similar methods may be used to generate a 
1 0 full-length clone corresponding to Ngjl3 . 

Alternatively, a full-length clone of NgR2 or NgR3 can be obtained by a 
person of ordinary skill in the art employing conventional PCR techniques. 

Example 9: Hybridization Analysis to demonstrate NgR expression in the brain 

15 The expression of NgR in mammals, such as the rat, may be investigated by in 

situ hybridization histochemistry. To investigate expression in the brain, for example, 
coronal and sagittal rat brain cryosections (20 fim thick) are prepared using a Reichert- 
Jung cryostat. Individual sections are thaw-mounted onto silanized, nuclease-free 
slides (CEL Associates, Inc., Houston, TX), and stored at -80 B C. Sections are 

20 processed starting with post-fixation in cold 4% paraformaldehyde, rinsed in cold 

phosphate-buffered saline (PBS), acetylated using acetic anhydride in triethanolamine 
buffer, and dehydrated through a series of alcohol washes in 70%, 95%, and 100% . 
alcohol at room temperature. Subsequently, sections are delipidated in chloroform, 
followed by rehydration through successive exposure to 100% and 95% alcohol at 

25 room temperature. Microscope slides containing processed cryosections are allowed 
to air dry prior to hybridization. Other tissues may be assayed in a similar fashion. 

A NgR-specific probe may be generated using PCR. Following PCR 
amplification, the fragment is digested with restriction enzymes and cloned into 
pBluescript II cleaved with the same enzymes. For production of a probe specific for 

30 the sense strand of NgR, a cloned NgR fragment cloned in pBluescript II may be 
linearized with a suitable restriction enzyme, which provides a substrate for labeled 
run-off transcripts (i.e., cRNA riboprobes) using the vector-borne T7 promoter and 



i 
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commercially available T7 RNA polymerase. A probe specific for the antisense strand 
of NgR may also be readily prepared using the NgR clone in pBluescript II by cleaving 
the recombinant plasmid with a suitable restriction enzyme to generate a linearized 
substrate for the production of labeled run-off cRNA transcripts using the T3 promoter 
5 and cognate polymerase. The riboprobes may be labeled with [ 35 S]-UTP to yield a 
specific activity of about 0.40 x 10 6 cpm/pmol for antisense riboprobes and about 0.65 
x 10 6 cpm/pmol for sense-strand riboprobes. Each riboprobe may be subsequently 
denatured and added (2 pmol/ml) to hybridization buffer which contains 50% 
formamide, 10% dextran, 0.3 MNaCl, 10 mM Tris (pH 8.0), 1 mM EDTA, IX 

10 Denhardt's Solution, and 10 mM dithiothreitol. Microscope slides containing 

sequential brain cryosections may be independently exposed to 45 |il of hybridization 
solution per slide and silanized cover slips may be placed over the sections being 
exposed to hybridization solution. Sections are incubated overnight (15-18 hours) at 
52°C to allow hybridization to occur. Equivalent series of cryosections are then 

15 exposed to sense or antisense NgR-specific cRNA riboprobes. 

Following the hybridization period, coverslips are washed off the slides in IX 
SSC, followed by RNase A treatment involving the exposure of slides to 20 jig/ml 
KNase A in a buffer containing 10 mM Tris-HCl (pH 7.4), 0.5 M EDTA, and 0.5 M 
NaCl for 45 minutes at 37°C. The cryosections are then subjected to three 

20 high-stringency washes in 0. 1 X SSC at 52°C for 20 minutes each. Following the 

series of washes, cryosections are dehydrated by consecutive exposure to 70%, 95%, 
and 100% ammonium acetate in alcohol, followed by air drying and exposure to 
Kodak BioMax™ MR-1 film. After 13 days of exposure, the film is developed, and 
any significant hybridization signal is detected. Based on these results, slides containing 

25 tissue that hybridized, as shown by film autoradiograms, are coated with Kodak NTB- 
2 nuclear track emulsion and the slides are stored in the dark for 32 days. The slides 
are then developed and counterstained with hematoxylin. Emulsion-coated sections 
are analyzed microscopically to determine the specificity of labeling. The signal is 
determined to be specific if autoradiographic grains (generated by antisense probe 

30 hybridization) are clearly associated with cresyl violate-stained cell bodies. 

Autoradiographic grains found between cell bodies indicate non-specific binding of the 
probe. 
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In some cases, such as using a probe to detect a NgR homolog in a 
heterologous species, in order to achieve optimal hybridization, it may be necessary to 



activity have utility for treating neurological disorders. Some other diseases for which 
modulators of NgR may have utility include depression, anxiety, bipolar disease, 
epilepsy, neuritis, neurasthenia, neuropathy, neuroses, and the like. Use of NgR 
modulators, including NgR ligands and anti-NgR antibodies, to treat individuals having 
10 such disease states is intended as an aspect of the invention. 

Example 10: Northern Blot Analysis of NgR-RNA with a PCR-generated Probe 

Northern blot hybridizations may be performed to examine the expression of 
NgR mRNA. A clone containing at least a portion of the sequence of SEQ ID NO: 1 
15 may be used as a probe. Vector-specific primers are used in PCR to generate a 
hybridization probe fragment for 32 P-labeling. The PCR is performed as follows: 



5 



decrease the stringency conditions. Such conditions are well known to those of 
ordinary skill in the art and examples are provided above. 

Expression of NgR in the brain provides an indication that modulators of NgR 



Mix: l\xl 



NgR-containing plasmid 
fwd primer (10-50 pM) 
rev primer (10-50 pM) 

lOxPCR buffer (such as that provided with the enzyme, 
Amersham Pharmacia Biotech) 

lOmM dNTP (such as #1 969 064 from Boehringer Mannheim) 
Taq polymerase (such as #27-0799-62, Amersham Pharmacia 
Biotech) 



20 



2\il 
2\il 



0.5[il 



25 



83.5jxl water 
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PCR is performed in a Thermocycler using the following program: 



94°C 



5min 



5 



94°C 
55°C 
72°C 



lmin 
lmin 
lmin 



30 cycles 



1TC 



lOmin 



10 



The PCR product may be purified using QIAquick PCR Purification Kit 
" (#28104) from Qiagen, and radictively labeled with 32 P-dCTP (#AA0005/250, 
Amersham Pharmacia Biotech)) may be done by random priming using "Ready-to-go 
DNA Labeling Beads" (#27-9240-01) from Amersham Pharmacia Biotech. 

1 5 Hybridization is carried out on Human Multiple Tissue Northern Blot from Clontech as 
described in manufacturer's protocol, or on a Northern Blot prepared by running RNA 
samples from cells of interest on an agarose gel and blotting to a membrane using any 
of the known Northern blotting protocols. After exposure overnight on Molecular 
Dynamics Phosphor Imager screen (#MD146-814) bands of an appropriate size are 

20 visualized. 

Example 11: Recombinant Expression of NgR in Eukaryotic Host Cells 



25 suitable host cell using a suitable expression vector and standard genetic engineering 
techniques. For example, a NgR-encoding sequence described in Table 4 is subcloned 
into the commercial expression vector pzeoS V2 (Invitrogen, San Diego, CA) and 
transfected into Chinese Hamster Ovary (CHO) cells using the transfection reagent 
FuGENE6™ (Boehringer-Mannheim) and the transfection protocol provided in the 

30 product insert. Other eukaryotic cell lines, including human embryonic kidney (HEK 
293) and COS cells, are suitable as well. Cells stably expressing NgR are selected by 
growth in the presence of 100 p,g/ml zeocin (Stratagene, LaJolla, CA). As an 
alternative to FuGENE6™, the expression vector may carry the gene for dihydrofolate 
reductase (dhfr) and selection of clones with methotrexate (MTX) drug pressure 



A, Expression of NgR in Mammalian Cells 

To produce NgR protein, a NgR-encoding polynucleotide is expressed in a 
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allows for stable transformation of CHO cells. Optionally, NgR may be purified from 
the cells using standard chromatographic techniques. To facilitate purification, antisera 
is raised against one or more synthetic peptide sequences that correspond to portions 
of the NgR amino acid sequence, and the antisera is used to affinity purify Nogo-R. 
5 The NgR also may be expressed in-frame with a tag sequence (e.g., polyhistidine, 
hemaglutinin, FLAG) to facilitate purification. Moreover, it will be appreciated that 
many of the uses for NgR polypeptides, such as assays described below, do not require 
purification of NgR from the host cell. 

10 B. Expression of NgR in CHO cells 

For expression of NgR in Chinese hamster ovary (CHO) cells, a plasmid 
bearing the relevant NgR coding sequence is prepared, using a vector which also bears 
the selectable marker dihydrofolate reductase (DHFR). The plasmid is transfected into 
CHO cells. Selection under MTX drug pressure allows for preparation of stable 
15 transformants of a NgR (NgR2 or NgR3) in an expression plasmid carrying a 
selectable marker such as DHFR. 

C Expression of NgR in 293 cells 

For expression of NgR in mammalian cells 293 (transformed human, primary 
20 embryonic kidney cells), a plasmid bearing the relevant NgR coding sequence is 
prepared, using vector pSecTag2A (Invitrogen). Vector pSecTag2A contains the 
murine IgK chain leader sequence for secretion, the c-myc epitope for detection of the 
recombinant protein with the anti-myc antibody, a C-terrninal polyhistidine for 
purification with nickel chelate chromatography, and a Zeocin resistant gene for 
25 selection of stable transfectants. The forward primer for amplification of this NgR 
cDNA is determined by routine procedures and preferably contains a 5' extension of 
nucleotides to introduce the Hindm cloning site and nucleotides matching the NgR 
sequence. The reverse primer is also determined by routine procedures and preferably 
contains a 5' extension of nucleotides to introduce an Xhol restriction site for cloning 
30 and nucleotides corresponding to the reverse complement of the NgR sequence. The 
PCR conditions are 55°C as the annealing temperature. The PCR product is gel 
purified and cloned into the HindHirXhol sites of the vector. 
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The DNA is purified using Qiagen chromatography columns and transfected 
into 293 cells using DOTAP™ transfection media (Boehringer Mannheim, 
Indianapolis, IN). Transiently transfected cells are tested for expression after 24 hours 
of transfection, using western blots probed with anti-His and anti-NgR peptide 
5 antibodies. Permanently transfected cells are selected with Zeocin and propagated. 
Production of the recombinant protein is detected from both cells and media by 
Western blots probed with anti-His, anti-Myc or anti-NgR peptide antibodies. 

D, Transient Expression of Nogo-R in COS cells 

10 For expression of the NgR in COS7 cells, a polynucleotide molecule having a 

nucleotide sequence of SEQ ID NO: 1, for example, can be cloned into vector p3-CI. 
This vector is a pUC18-derived plasmid that contains the HCMV (human 
cytomegalovirus) promoter-intron located upstream from the bGH (bovine growth 
hormone) polyadenylation sequence and a multiple cloning site. 

1 5 The forward primer is determined by routine procedures and preferably 

contains a 5 f extension which introduces an Xbal restriction site for cloning, followed 
by nucleotides which correspond to a nucleotide sequence of SEQ ID NO: 1 . The 
reverse primer is also determined by routine procedures and preferably contains 5- 
extension of nucleotides which introduces a Sail cloning site followed by nucleotides 

20 which correspond to the reverse complement of a nucleotide sequence of SEQ ED 
NO:l. 

The PCR consists of an initial denaturation step of 5 min at 95 °C, 30 cycles of 
30 sec denaturation at 95°C, 30 sec annealing at 58"C and 30 sec extension at 72°C, 
followed by 5 min extension at 72°C. The PCR product is gel purified and ligated into 

25 the Xbal and Sail sites of vector p3-CI. This construct is transformed into E. coli cells 
for amplification and DNA purification. The DNA is purified with Qiagen 
chromatography columns and transfected into COS 7 cells using Lipofectamine™ 
reagent from BRL, following the manufacturer's protocols. Forty-eight and 72 hours 
after transfection, the media and the cells are tested for recombinant protein 

30 expression. 

NgR expressed from a COS cell culture can be purified by concentrating the 
cell-growth media to about 10 mg of protein/ml, and purifying the protein by, for 
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example, chromatography. Purified NgR is concentrated to 0.5 mg/ml in an Amicon 
concentrator fitted with a YM-10 membrane and stored at -80°C. NgR3 may also be 
expressed using this method and the nucleotide sequence of SEQ ID NO:3 or SEQ ID 
NO:13. 

5 

E. Expression of NgR in Insect Cells 

For expression of NgR in a baculovirus system, a polynucleotide molecule 
having a nucleotide sequence of SEQ ID NO: 1, 3 or 13 can be amplified by PCR. The 
forward primer is determined by routine procedures and preferably contains a 5' 

10 extension which adds the Ndel cloning site, followed by nucleotides which correspond 
to a nucleotide sequence of SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID NO: 13, 
respectively). The reverse primer is also determined by routine procedures and 
preferably contains a 5' extension which introduces the Kpnl cloning site, followed by 
nucleotides which correspond to the reverse complement of a nucleotide sequence of 

15 SEQ ID NO:l (or SEQ ID NO:3 or SEQ ID NO:13, respectively). 

The PCR product is gel purified, digested with Ndel and Kpnl, and cloned into 
the corresponding sites of vector pACHTL-A (Pharmingen, San Diego, CA). The 
pAcHTL expression vector contains the strong polyhedrin promoter of the 
Autographa californica nuclear polyhedrosis virus (AcMNPV), and a 6XHis tag 

20 upstream from the multiple cloning site. A protein kinase site for phosphorylation and 
a thrombin site for excision of the recombinant protein precede the multiple cloning 
site is also present. Of course, many other baculovirus vectors could be used in place 
of pAcHTL-A, such as pAc373, pVL941 and pAcIMl. Other suitable vectors for the 
expression of NgR polypeptides can be used, provided that the vector construct 

25 includes appropriately located signals for transcription, translation, and trafficking, 

such as an in-frame AUG and a signal peptide, as required. Such vectors are described 
in Luckow et al, Virology 170:31-39, among others. 

The virus is grown and isolated using standard baculovirus expression methods, 
such as those described in Summers et al (1987) A MANUAL OF METHODS FOR 

30 Baculovirus Vectors and Insect Cell Culture Procedures, Texas Agricultural 
Experimental Station Bulletin No. 1555. 
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In a preferred embodiment, p AcHLT-A containing NgR gene is introduced into 
baculovirus using the "BaculoGold™" transfection kit (Pharmingen, San Diego, CA) 
using methods established by the manufacturer. Individual virus isolates are analyzed 
for protein production by radiolabeling infected cells with 35 S-methionine at 24 hours 
5 post infection. Infected cells are harvested at 48 hours post infection, and the labeled 
proteins are visualized by SDS-PAGE. Viruses exhibiting high expression levels can 
be isolated and used for scaled up expression. 

For expression of a NgR polypeptide in a Sf9 cells, a polynucleotide molecule 
having the nucleotide sequence of SEQ ID NO: 1 (or SEQ ID NO:3 or SEQ ID 

10 NO: 13) can be amplified by PCR using the primers and methods described above for 
baculovirus expression. The NgR cDNA is cloned into vector pAcHLT-A 
(Pharmingen) for expression in Sf9 insect. The insert is cloned into the Ndel and Kpnl 
sites, after elimination of an internal Ndel site (using the same primers described above 
for expression in baculovirus). DNA is purified with Qiagen chromatography columns 

1 5 and expressed in Sf9 cells. Preliminary Western blot experiments from non-purified 
plaques are tested for the presence of the recombinant protein of the expected size 
which reacted with the NgR-specific antibody. These results are confirmed after 
further purification and expression optimization in EBG5 cells. 

20 F. Expression of soluble forms of NgR2 and NgR3 as NgR-Ig fusion 

proteins. 

To generate aNgR2-Ig fusion protein, standard methods may be used as 
described in the literature (e.g. Sanicola et al (1997) Proc. Natl. Acad. Sci. USA. 94, 
6238-6243). For example, a DNA fragment encoding NgR2 without the sequence 

25 encoding the hydrophobic C-terminus (GPI anchor signal) may be ligated to a DNA 
fragment encoding the Fc domain of IgGl (which may be human IgGl), and the 
chimeric fragment may be cloned into an expression vector to generate a plasmid. The 
plasmid may then be transfected into Chinese hamster ovary cells to generate a stable 
cell line producing the fusion protein. The fusion protein is then purified from 

30 conditioned media using standard methods. For example, clarified conditioned media 
from the cell line may be loaded by gravity directly onto Protein A Sepharose. The 
column may then be washed with five column volumes each of PBS, PBS containing 
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0.5 MNaCl, and 25 mM sodium phosphate, 100 mMNaCl (pH 5.0). The bound 
protein may then be eluted with 25 mM NaH 2 P0 4 , 100 mM NaCl (pH 2.8) and 
immediately neutralized with 1/10 fraction volume of 0.5 M Na2HP0 4 (pH 8.6). 
Similar methods may be used to generate a NgR3-Ig fusion protein. 

5 

Example 12: Interaction Trap/Two-Hybrid System 

In order to assay for NgR-interacting proteins, the interaction trap/two-hybrid 
library screening method can be used. This assay was first described in Fields et al 
(1989) Nature 340, 245, which is incorporated herein by reference in its entirety. A 
10 protocol is published in CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 1999, John 
Wiley & Sons, NY and Ausubel, F. M. et al 1992, Short PROTOCOLS IN Molecular 
BIOLOGY, fourth edition, Greene and Wiley-interscience, NY, which is incorporated 
herein by reference in its entirety. Kits are available from Clontech, Palo Alto, CA 
(Matchmaker Two-Hybrid System 3). 
1 5 A fusion of the nucleotide sequences encoding all or partial NgR and the yeast 

transcription factor GAL4 DNA-binding domain (DNA-BD) is constructed in an 
appropriate plasmid (i.e., pGBKT7) using standard subcloning techniques. Similarly, a 
GAL4 active domain (AD) fusion library is constructed in a second plasmid (i.e., 
pGADT7) from cDNA of potential NgR-binding proteins (for protocols on forming 
20 cDNA libraries, see Sambrook et al. 1989, Molecular Cloning: ALaboratory 
Manual, second edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY), which is incorporated herein by reference in its entirety. The DNA-BD/NgR 
fusion construct is verified by sequencing, and tested for autonomous reporter gene 
activation and cell toxicity, both of which would prevent a successful two-hybrid 
analysis. Similar controls are performed with the AD/library fusion construct to ensure 
expression in host cells and lack of transcriptional activity. Yeast cells are transformed 
(ca. 105 transformants/mg DNA) with both the NgR and library fusion plasmids 
according to standard procedure (Ausubel, etal, 1992, Short Protocols in 
Molecular Biology, fourth edition, Greene and Wiley-interscience, NY, which is 
incorporated herein by reference in its entirety). In vivo binding of DNA-BD/NgR 
with AD/library proteins results in transcription of specific yeast plasmid reporter 
genes (i.e., IacZ f HIS3, ADE2, LEU2). Yeast cells are plated on nutrient-deficient 
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media to screen for expression of reporter genes. Colonies are dually assayed for 
P-galactosidase activity upon growth in Xgal 

(5-bromo-4-chloro-3-indolyl-b-D-galactoside) supplemented media (filter assay for 
b-galactosidase activity is described in Breeden et al, (1985) Cold Spring Harb. Symp. 
5 Quant Biol, 50, 643, which is incorporated herein by reference in its entirety). 

Positive AD-library plasmids are rescued from transformants and reintroduced into the 
original yeast strain as well as other strains containing unrelated DNA-BD fusion 
proteins to confirm specific NgR/library protein interactions. Insert DNA is sequenced 
to verify the presence of an open reading frame fused to GAL4 AD and to determine 
1 0 the identity of the NgR-binding protein. 

Example 13: Antibodies to Nogo-R 

Standard techniques are employed to generate polyclonal or monoclonal 
antibodies to the NgR receptor, and to generate useful antigen-binding fragments 

1 5 thereof or variants thereof, including "humanized" variants. Such protocols can be 
found, for example, in Sambrook etal (1989), above, and Harlow et al (Eds.), 
Antibodies A Laboratory manual; Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY (1988). In one embodiment, recombinant NgR polypeptides (or 
cells or cell membranes containing such polypeptides) are used as antigen to generate 

20 the antibodies. In another embodiment, one or more peptides having amino acid 

sequences corresponding to an immunogenic portion of NgR (e.g., 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19, 20, or more amino acids) are used as antigen. Peptides 
corresponding to extracellular portions of Nogo-R, especially hydrophilic extracellular 
portions, are preferred. The antigen may be mixed with an adjuvant or linked to a 

25 hapten to increase antibody production. 

A. Polyclonal or Monoclonal antibodies 

As one exemplary protocol, recombinant NgR or a synthetic fragment thereof 
is used to immunize a mouse for generation of monoclonal antibodies (or larger 
30 mammal, such as a rabbit, for polyclonal antibodies). To increase antigenicity, 

peptides are conjugated to Keyhole Limpet Hemocyanin (Pierce), according to the 
manufacturer's recommendations. For an initial injection, the antigen is emulsified with 
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Freund's Complete Adjuvant and injected subcutaneously. At intervals of two to three 
weeks, additional aliquots of NgR antigen are emulsified with Freund's Incomplete 
Adjuvant and injected subcutaneously. Prior to the final booster injection, a serum 
sample is taken from the immunized mice and assayed by western blot to confirm the 
5 presence of antibodies that immunoreact with NgR. Serum from the immunized 

animals may be used as polyclonal antisera or used to isolate polyclonal antibodies that 
recognize NgR. Alternatively, the mice are sacrificed and their spleen removed for 
generation of monoclonal antibodies. 

To generate monoclonal antibodies, the spleens are placed in 10 ml serum-free 

10 RPMI 1640, and single cell suspensions are formed by grinding the spleens in serum- 
free RPMI 1640, supplemented with 2 raM L-glutamine, 1 mM sodium pyruvate, 100 
units/ml penicillin, and 100 |ig/ml streptomycin (RPMI) (Gibco, Canada). The cell 
suspensions are filtered and washed by centrifugation and resuspended in serum-free 
RPMI. Thymocytes taken from three naive Balb/c mice are prepared in a similar 

15 manner and used as a Feeder Layer. NS-1 myeloma cells, kept in log phase in RPMI 
with 10% fetal bovine serum (FBS) (Hyclone Laboratories, Inc., Logan, UT) for three 
days prior to fusion, are centrifuged and washed as well. 

To produce hybridoma fusions, spleen cells from the immunized mice are 
combined with NS-1 cells and centrifuged, and the supernatant is aspirated. The cell 

20 pellet is dislodged by tapping the tube, and 2 ml of 37'C PEG 1500 (50% in 75 mM 
HEPES, pH 8.0) (Boehringer-Mannheim) is stirred into the pellet, followed by the 
addition of serum-free RPMI. Thereafter, the cells are centrifuged, resuspended in 
RPMI containing 15% FBS, 100 \sM sodium hypoxanthine, 0.4 \iM aminopterin, 16 
[xM thymidine (HAT) (Gibco), 25 units/ml IL-6 (Boehringer-Mannheim) and 1.5 x 10 6 

25 thymocytes/ml, and plated into 10 Corning flat-bottom 96-well tissue culture plates 
(Corning, Corning, NY). 

On days 2, 4, and 6 after the fusion, 100 pi of medium is removed from the 
wells of the fusion plates and replaced with fresh medium. On day 8, the fusions are 
screened by ELISA, testing for the presence of mouse IgG that binds to NgR. 

30 Selected fusion wells are further cloned by dilution until monoclonal cultures 
producing anti-NgR antibodies are obtained. 
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B. Humanization of anti-NgR monoclonal antibodies 

The expression pattern of NgR as reported herein and the potential of NgRs as 
targets for therapeutic intervention suggest therapeutic indications for NgR inhibitors 
(antagonists). NgR-neutralizing antibodies comprise one class of therapeutics useful as 
5 NgR antagonists. . Following are protocols to improve the utility of anti-NgR 

monoclonal antibodies as therapeutics in humans by "humanizing" the monoclonal 
antibodies to improve their serum half-life and render them less immunogenic in human 
hosts to prevent human antibody response to non-human anti-NgR antibodies). 

The principles of humanization have been described in the literature and are 
10 facilitated by the modular arrangement of antibody proteins. To minimize the 
possibility of binding complement, a humanized antibody of the IgG4 isotype is 
preferred. 

For example, a level of humanization is achieved by generating chimeric 
antibodies comprising the variable domains of non-human antibody proteins of interest 

15 with the constant domains of human antibody molecules. (See, e.g., Morrison et al, 
(1989) Adv. Immunol, 44, 65-92). The variable domains of NgR-neutralizing anti- 
NgR antibodies are cloned from the genomic DNA of a B-cell hybridoma or from 
cDNA generated from mRNA isolated from the hybridoma of interest. The V region 
gene fragments are linked to exons encoding human antibody constant domains, and 

20 the resultant construct is expressed in suitable mammalian host cells (e.g., myeloma or 
CHO cells). 

To achieve an even greater level of humanization, only those portions of the 
variable region gene fragments that encode antigen-binding complementarity 
determining regions ("CDR") of the non-human monoclonal antibody genes are cloned 

25 into human antibody sequences. (See, e.g., Jones et al, (1986) Nature 321, 522-525; 
Riechmann etal, (1988) Nature 332, 323-327; Verhoeyen etal, (1988) Science 239, 
1534-1536 ; and Tempest et al, (1991) Bio/Technology 9, 266-271). If necessary, the 
J3-sheet framework of the human antibody surrounding the CDR3 regions also is 
modified to more closely mirror the three dimensional structure of the antigen-binding 

30 domain of the original monoclonal antibody. (See Kettleborough et al, (1991) Protein 
Engin. 4, 773-783; and Foote et al, (1992) J. Mol Biol 224, 487-499). 
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In an alternative approach, the surface of a non-human monoclonal antibody of 
interest is humanized by altering selected surface residues of the non-human antibody, 
e.g., by site-directed mutagenesis, while retaining all of the interior and contacting 
residues of the non-human antibody. See Padlan (1991) Afo/. Immunol. 28, 489-498 . 
5 The foregoing approaches are employed using NgR-neutralizing anti-NgR 

monoclonal antibodies and the hybridomas that produce them to generate humanized 
NgR-neutralizing antibodies useful as therapeutics to treat or palliate conditions 
wherein NgR expression or ligand-mediated NgR signaling is detrimental. 

10 C. Human NgR-Neutralizing Antibodies from Phage Display 

Human NgR-neutralizing antibodies are generated by phage display techniques 
such as those described in Aujame et al (1997) Human Antibodies 8, 155-168; 
Hoogenboom (1997) TBBTECH 15, 62-70; and Rader et aL (1997), Curr. Opin. 
BiotechnoL 8, 503-508, all of which are incorporated by reference. For example, 

15 antibody variable regions in the form of Fab fragments or linked single chain Fv 

fragments are fused to the amino terminus of filamentous phage minor coat protein 
pin. Expression of the fusion protein and incorporation thereof into the mature phage 
coat results in phage particles that present an antibody on their surface and contain the 
genetic material encoding the antibody. A phage library comprising such constructs is 

20 expressed in bacteria, and the library is screened for NgR-specific phage-antibodies 
using labeled or immobilized NgR as antigen-prbbe. 

D. Human NgR-neutralizing antibodies from transgenic mice 

Human NgR-neutralizing antibodies are generated in transgenic mice essentially 
25 as described in Bruggemann et al (1996) Immunol Today 17, 391-397 and 

Bruggemannefa/. (1997) Curr. Opin. Biotechnol. 8, 455-458. Transgenic mice 
carrying human V-gene segments in germline configuration and that express these 
transgenes in their lymphoid tissue are immunized with a NgR composition using 
conventional immunization protocols, hybridomas are generated using B cells from the 
30 immunized mice using conventional protocols and screened to identify hybridomas 
secreting anti-NgR human antibodies {e.g., as described above). 
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Example 14: Assays to Identify Modulators of NgR Activity 

Set forth below are several nonlimiting assays for identifying modulators 
(agonists and antagonists) of NgR activity. Among the modulators that can be 
identified by these assays are natural ligand compounds of the receptor; synthetic 
5 analogs and derivatives of natural ligands; antibodies, antibody fragments, and/or 
antibody-like compounds derived from natural antibodies or from antibody-like 
combinatorial libraries; and/or synthetic compounds identified by high-throughput 
screening of libraries; and the like. All modulators that bind NgR are useful for 
identifying NgR in tissue samples (e.g., for diagnostic purposes, pathological purposes, 
10 and the like). Agonist and antagonist modulators are useful for up-regulating and 
down-regulating NgR activity, respectively, to treat disease states characterized by 
abnormal levels of NgR activity. The assays may be performed using single putative 
modulators, and/or may be performed using a known agonist in combination with 
candidate antagonists (or visa versa). 

15 

A. cAMP Assays 

In one type of assay, levels of cyclic adenosine monophosphate (cAMP) are 
measured inNgJl-transfected cells that have been exposed to candidate modulator 
compounds. Protocols for cAMP assays have been described in the literature. (See, 

20 e.g., Sutherland et al, (1968) Circulation 37, 279; Frandsen et al, (1976) Life 

Sciences 18, 529-541; Dooley et al, (1997) J. Pharmacol Exp. Therap. 283, 735-41; 
and George et al, (1997) J. Biomol Screening 2, 235-40). An exemplary protocol for 
such an assay, using an Adenylyl Cyclase Activation FlashPlate® Assay from NEN™ 
Life Science Products, is set forth below, 

25 Briefly, the NgR coding sequence (e.g., a cDNA or intronless genomic DNA) 

is subcloned into a commercial expression vector, such as pzeoSV2 (Invitrogen), and 
transiently transfected into Chinese Hamster Ovary (CHO) cells using known methods, 
such as the transfection protocol provided by Boehringer-Mannheim when supplying 
the FuGENE 6 transfection reagent. Transfected CHO cells are seeded into 96-well 

30 microplates from the FlashPlate® assay kit, which are coated with solid scintillant to 
which antisera to cAMP has been bound. For a control, some wells are seeded with 
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wild type (untransfected) CHO cells. Other wells in the plate receive various amounts 
of a cAMP standard solution for use in creating a standard curve. 

One or more test compounds (i.e., candidate modulators) are added to the cells 
in each well, with water and/or compound-free medium/diluent serving as a control or 
5 controls. After treatment, cAMP is allowed to accumulate in the cells for exactly 15 
minutes at room temperature. The assay is terminated by the addition of lysis buffer 
containing [ 125 I]-labeled cAMP, and the plate is counted using a Packard Topcount™ 
96-well microplate scintillation counter. Unlabeled cAMP from the lysed cells (or 
from standards) and fixed amounts of [ I]-cAMP compete for antibody bound to the 

10 plate. A standard curve is constructed, and cAMP values for the unknowns are 

obtained by interpolation. Changes in intracellular cAMP levels of cells in response to 
exposure to a test compound are indicative of NgR modulating activity. Modulators 
that act as agonists of receptors which couple to the G s subtype of G proteins will 
stimulate production of cAMP, leading to a measurable 3-10 fold increase in cAMP 

1 5 levels. Agonists of receptors which couple to the subtype of G proteins will inhibit 
forskolin-stimulated cAMP production, leading to a measurable decrease in cAMP 
levels of 50-100%. Modulators that act as inverse agonists will reverse these effects at 
receptors that are either constitutive!/ active or activated by known agonists. 

20 B. Aequorin Assays 

In another assay, cells (e.g., CHO cells) are transiently co-transfected with both 
a NgR expression construct and a construct that encodes the photoprotein apoaquorin. 
In the presence of the cofactor coelenterazine, apoaquorin will emit a measurable 
luminescence that is proportional to the amount of intracellular (cytoplasmic) free 

25 calcium. (See generally, Cobbold, et ah "Aequorin measurements of cytoplasmic free 
calcium," In: McCormack J.G. and Cobbold P.H., eds., CELLULAR CALCIUM: A 
Practical Approach. Oxford:IRL Press (1991); Stables et aL 9 (1997) Anal 
Biochem. 252, 115-26; andHaugland, Handbook OF FLUORESCENT Probes AND 
Research Chemicals. Sixth edition. Molecular Probes, Eugene, OR (1996)). 

30 In one exemplary assay, NgR is subcloned into the commercial expression 

vector pzeoSV2 (Invitrogen) and transiently co-transfected along with a construct that 
encodes the photoprotein apoaquorin (Molecular Probes, Eugene, OR) into CHO cells 
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using the transfection reagent FuGENE 6 (Boehringer-Mannheim) and the transfection 
protocol provided in the product insert. 

The cells are cultured for 24 hours at 37°C in MEM (Gibco/BRL, 
Gaithersburg, MD) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 
5 U/ml penicillin and 10 [ig/ml streptomycin, at which time the medium is changed to 
serum-free MEM containing 5 \iM coelenterazine (Molecular Probes, Eugene, OR). 
Culturing is then continued for two additional hours at 37°C. Subsequently, cells are 
detached from the plate using VERSEN (Gibco/BRL), washed, and resuspended at 
200,000 cells/ml in serum-free MEM. 

10 Dilutions of candidate NgR modulator compounds are prepared in serum-free 

MEM and dispensed into wells of an opaque 96-well assay plate at 50 jil/well. Plates 
are then loaded onto an MLX microtiter plate luminometer (Dynex Technologies, Inc., 
Chantilly, VA). The instrument is programmed to dispense 50 jil cell suspensions into 
each well, one well at a time, and immediately read luminescence for 15 seconds. 

15 Dose-response curves for the candidate modulators are constructed using the area 
under the curve for each light signal peak. Data are analyzed with SlideWrite, using 
the equation for a one-site ligand, and EC 50 values are obtained. Changes in 
luminescence caused by the compounds are considered indicative of modulatory 
activity. Modulators that act as agonists at receptors which couple to the G q subtype 

20 of G proteins give an increase in luminescence of up to 100 fold. Modulators that act 
as inverse agonists will reverse this effect at receptors that are either constitutively 
active or activated by known agonists. 

C. Luciferase Reporter Gene Assay 

25 The photoprotein luciferase provides another useful tool for assaying for 

modulators of NgR activity. Cells (e.g., CHO cells or COS 7 cells) are transiently co- 
transfected with both a NgR expression construct (e.g., NgR in pzeoSV2) and a 
reporter construct which includes a gene for the luciferase protein downstream from a 
transcription factor binding site, such as the cAMP-response element (CRE), AP-1, or 

30 NF-kappa B. Expression levels of luciferase reflect the activation status of the 

signaling events. (See generally, George etal. (1997) J. Biomol Screening!, 235- 
240; and Stratowa etal (1995) Curr. Opin. BiotechnoL 6, 574-581). Luciferase 
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activity may be quantitatively measured using, e.g., luciferase assay reagents that are 
commercially available from Promega (Madison, WI). 

In one exemplary assay, CHO cells are plated in 24-well culture dishes at a 
density of 100,000 cells/well one day prior to transfection and cultured at 37°C in 
5 MEM (Gibco/BRL) supplemented with 10% fetal bovine serum, 2 mM glutamine, 10 
U/ml penicillin and 10 |xg/ml streptomycin. Cells are transiently co-transfected with 
both a NgR expression construct and a reporter construct containing the luciferase 
gene. The reporter plasmids CRE-luciferase, AP-1 -luciferase and NF-kappaB- 
luciferase may be purchased from Stratagene (Legally, CA). Transfections are 

10 performed using the FuGENE 6 transfection reagent (Boehringer-Mannheim) 

according to the supplier's instructions. Cells transfected with the reporter construct 
alone are used as a control. Twenty-four hours after transfection, cells are washed 
once with PBS pre-warmed to 37°C. Serum-free MEM is then added to the cells 
either alone (control) or with one or more candidate modulators and the cells are 

1 5 incubated at 37°C for five hours. Thereafter, cells are washed once with ice-cold PBS 
and lysed by the addition of 100 nl of lysis buffer per well from the luciferase assay kit 
supplied by Promega. After incubation for 15 minutes at room temperature, 15 ^1 of 
the lysate is mixed with 50 jil of substrate solution (Promega) in an opaque-white, 
96-well plate, and the luminescence is read immediately on a Wallace model 1450 

20 MicroBeta scintillation and luminescence counter (Wallace Instruments, Gaithersburg, 
MD). 

Differences in luminescence in the presence versus the absence of a candidate 
modulator compound are indicative of modulatory activity. Receptors that are either 
constitutively active or activated by agonists typically give a 3-20-fold stimulation of 
25 luminescence compared to cells transfected with the reporter gene alone. Modulators 
that act as inverse agonists will reverse this effect. 



D. Intracellular calcium measurement using FLEPR 

Changes in intracellular calcium levels are another recognized indicator of 
30 receptor activity, and such assays can be employed to screen for modulators of NgR 
activity. For example, CHO cells stably transfected with a NgR expression vector are 
plated at a density of 4 x 10 4 cells/well in Packard black-walled, 96-well plates 
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specially designed to discriminate fluorescence signals emanating from the various 
wells on the plate. The cells are incubated for 60 minutes at 37°C in modified 
Dulbecco's PBS (D-PBS) containing 36 mg/L pyruvate and 1 g/L glucose with the 
addition of 1% fetal bovine serum and one of four calcium indicator dyes (Fluo-3™ 
5 AM, Fluo-4™ AM, Calcium Green™- 1 AM, or Oregon Green™ 488 BAPTA-1 AM), 
each at a concentration of 4 pM. Plates are washed once with modified D-PBS 
without 1% fetal bovine serum and incubated for 10 minutes at 37°C to remove 
residual dye from the cellular membrane. In addition, a series of washes with modified 
D-PBS without 1% fetal bovine serum is performed immediately prior to activation of 

10 the calcium response. 

A calcium response is initiated by the addition of one or more candidate 
receptor agonist compounds, calcium ionophore A23187 (10 jjlM; positive control), or 
ATP (4 [iM; positive control). Fluorescence is measured by Molecular Device's 
FLIPR with an argon laser (excitation at 488 nm). (See, e.g., Kuntzweiler et al 

15 (1998) Drug Dev. Res. 44,14-20). The F-stop for the detector camera is set at 2.5 and 
the length of exposure is 0.4 milliseconds. Basal fluorescence of cells is measured for 
20 seconds prior to addition of candidate agonist, ATP, or A23 187, and the basal 
fluorescence level is subtracted from the response signal. The calcium signal is 
measured for approximately 200 seconds, taking readings every two seconds. Calcium 

20 ionophore A23 1 87 and ATP increase the calcium signal 200% above baseline levels. 
In general, activated NgRs increase the calcium signal at least about 10-15% above 
baseline signal. 

E. [ 35 S]GTPyS Binding Assay 

25 It is also possible to evaluate whether NgR signals through a G 

protein-mediated pathway. Because G protein-coupled receptors signal through 
intracellular G proteins whose activity involves GTP binding and hydrolysis to yield 
bound GDP, measurement of binding of the non-hydrolyzable GTP analog [ 35 S]- 
GTPyS in the presence and absence of candidate modulators provides another assay 

30 for modulator activity. (See, e.g., Kowal et al, (1998) Neuropharmacology 37, 179- 
187.). 
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In one exemplary assay, cells stably transfected with a NgR expression vector 
are grown in 10 cm tissue culture dishes to subconfluence, rinsed once with 5 ml of 
ice-cold Ca 2+ /Mg 2+ -free phosphate-buffered saline, and scraped into 5 ml of the same 
buffer. Cells are pelleted by centrifugation (500 xg, 5 minutes), resuspended in TEE 
5 buffer (25 mM Tris, pH 7.5 , 5 mM EDTA, 5 mM EGTA), and frozen in liquid 

nitrogen. After thawing, the cells are homogenized using a Dounce homogenizer (1 ml 
TEE per plate of cells), and centrifuged at 1,000 x g for 5 minutes to remove nuclei 
and unbroken cells. 

The homogenate supernatant is centrifuged at 20,000 x g for 20 minutes to 

10 isolate the membrane fraction, and the membrane pellet is washed once with TEE and 
resuspended in binding buffer (20 mM HEPES, pH 7.5, 150 mM NaCl, 10 mM MgCl 2 , 
1 mM EDTA). The resuspended membranes can be frozen in liquid nitrogen and 
stored at -70°C until use. 

Aliquots of cell membranes prepared as described above and stored at -70°C 

15 are thawed, homogenized, and diluted into buffer containing 20 mM HEPES, 10 mM 
MgCl 2 , 1 mM EDTA, 120 mMNaCl, 10 \M GDP, and 0.2 mM ascorbate, at a 
concentration of 10-50 jig/ml. In a final volume of 90 p], homogenates are incubated 
with varying concentrations of candidate modulator compounds or 100 jjM GTP for 
30 minutes at 30"C and then placed on ice. To each sample, 10 \i\ guanosine 

20 5'-0-(3[ 35 S]thio) triphosphate (NEN, 1200 Ci/mmol; [ 35 S]-GTPyS), was added to a 
final concentration of 100-200 pM. Samples are incubated at 30°C for an additional 30 
minutes, 1 ml of 10 mM HEPES, pH 7.4, 10 mM MgCl 2 , at 4'C is added and the 
reaction is stopped by filtration. 

Samples are filtered over Whatman GF/B filters and the filters are washed with 

25 20 ml ice-cold 10 mM HEPES, pH 7.4, 10 mM MgCl 2 . Filters are counted by liquid 
scintillation spectroscopy. Nonspecific binding of [ 35 S]-GTPyS is measured in the 
presence of 100 \iM GTP and subtracted from the total. Compounds are selected that 
modulate the amount of [ 35 S]-GTPyS binding in the cells, compared to untransfected 
control cells. Activation of receptors by agonists gives up to a five-fold increase in 

30 [ 35 S]-GTPyS binding. This response is blocked by antagonists. 



F. [ 3 H] Arachidonic Acid Release 



wo 
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The activation of NgRs may also potentiate arachidonic acid release in cells, 
providing yet another useful assay for modulators of NgR activity. (See, e.g., 
Kanterman etal, (1991) Mol. Pharmacol 39,364-369.) For example, CHO cells that 
are stably transfected with a NgR expression vector are plated in 24-well plates at a 
5 density of 15,000 cells/well and grown in MEM medium supplemented with 10% fetal 
bovine serum, 2 mM glutamine, 10 U/ml penicillin and 10 tig/ml streptomycin for 48 
hours at 37°C before use. Cells of each well are labeled by incubation with 
[^-arachidonic acid (Amersham Corp., 210 Ci/mmol) at 0.5 |iCi/ml in 1 ml MEM 
supplemented with 10 mM HEPES, pH 7.5, and 0.5% fatty-acid-free bovine serum 
10 albumin for 2 hours at 37'C. The cells are then washed twice with 1 ml of the same 
buffer. 

Candidate modulator compounds are added in 1 ml of the same buffer, either 
alone or with 10 jiM ATP and the cells are incubated at 37°C for 30 minutes. Buffer 
alone and mock-transfected cells are used as controls. Samples (0.5 ml) from each 
15 well are counted by liquid scintillation spectroscopy. Agonists which activate the 
receptor will lead to potentiation of the ATP-stimulated release of [ 3 H]-arachidonic 
acid. This potentiation is blocked by antagonists. 

G. Extracellular Acidification Rate 

20 In yet another assay, the effects of candidate modulators of NgR activity are 

assayed by monitoring extracellular changes in pH induced by the test compounds (see, 
e.g., Dunlop et al (1998) J. Pharmacol. Toxicol Meth. 40, 47-55). In one 
embodiment, CHO cells transfected with a NgR expression vector are seeded into 12 
mm capsule cups (Molecular Devices Corp.) at 4 x 10 5 cells/cup in MEM 

25 supplemented with 10% fetal bovine serum, 2 mM L-glutamine, 10 U/ml penicillin, and 
10 jig/ml streptomycin. The cells are incubated in this medium at 37'C in 5% C0 2 for 
24 hours. 

Extracellular acidification rates are measured using a Cytosensor 
microphysiometer (Molecular Devices Corp.). The capsule cups are loaded into the 
30 sensor chambers of the microphysiometer and the chambers are perfused with running 
buffer (bicarbonate-free MEM supplemented with 4 mM L-glutamine, 10 units/ml 
penicillin, 10 jig/ml streptomycin, 26 mMNaCl) at a flow rate of 100 jil/minute. 
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Candidate agonists or other agents are diluted into the running buffer and perfused 
through a second fluid path. During each 60-second pump cycle, the pump is run for 
38 seconds and is off for the remaining 22 seconds. The pH of the running buffer in 
the sensor chamber is recorded during the cycle from 43-58 seconds, and the pump is 
5 re-started at 60 seconds to start the next cycle. The rate of acidification of the running 
buffer during the recording time is calculated by the Cytosoft program. Changes in the 
rate of acidification are calculated by subtracting the baseline value (the average of 4 
rate measurements immediately before addition of a modulator candidate) from the 
highest rate measurement obtained after addition of a modulator candidate. The 
10 selected instrument detects 61 mV/pH unit. Modulators that act as agonists of the 
receptor result in an increase in the rate of extracellular acidification compared to the 
rate in the absence of agonist. This response is blocked by modulators which act as 
antagonists of the receptor. 

15 Example 15: mNgR3 does not bind hNogo-A(1055-l 120) 

To functionally test the mouse NgR3 (hereinafter, mNgR3) for its ability to 
bind hNogo-A(l 055-1 120), a cDNA expression vector for a myc epitope-tagged 
mNgR3 protein was created. The mouse NgR3 cDNA was amplified by PCR from 
mouse adult brain cDNA, from the signal sequence to the stop codon, and ligated into 

20 the pSecTag2 vector such that the vector encodes a signal sequence followed by a myc 
tag followed by the mature mNgR3 sequence. This plasmid was transfected into 
COS07cells, and expression of a myc-tagged protein of the predicted size was verified 
by immunoblot analysis. Alkaline phosphatase-hNogo-A(1055-1120) binding studies 
and myc immunohistology were conducted as described (Fournier et al, supra). 

25 The cells expressing mNgR3 express the myc-tagged protein but binding to 

AP-hNogo-A(1055-1120) was not observed under the conditions employed (Fig. 8). 

Example 16: Identification of partial human NgR3 cDNA and protein sequences 
The tblastn program was used to search for the human homolog of mouse 
30 NgR3 . The mouse NgR3 protein sequence (SEQ ID NO:4) was used to query a 
proprietary human expressed sequence tag (EST) database from Incyte yielding one 
highly significant hit: Incyte Template ID 190989.1. This sequence (937 nucleotides) 
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contains an open reading frame of 3 12 amino acids in the second reverse frame that 
exhibits 88% identity with residues 66 to 381 of mouse NgR3 (SEQ ID NO:4), 
strongly indicating that it is part of the human NgR3 homolog. 

A query of SEQ ID NO:4 against the public human EST database in Genbank 
5 also produced a hit with a 465-bp EST (Accession number: R35699; Version number: 
R3 5699.1; GI: 792600). There are a number of single nucleotide deletions and 
insertions within this sequence which cause frame shift errors. All of the reliable 
sequence contained in this public EST is present in the Incyte EST (Template ID 
190989.1). 

10 To obtain more nucleotide sequence that would extend the amino acid 

sequence at that carboxy terminal end, the I.M.A.G.E. Consortium clone No. 38319, 
which corresponds to Genbank accession No. R35699, was purchased from Incyte 
Genomics Inc. and subjected to further DNA sequence analysis. This clone consists of 
a Notl/HinD m fragment containing the sequence of interest, cloned into the 

1 5 Notl/HinD m sites of the vector Lafinid BA 

(http://image.M.govAimage/html/Ubs/lafiiudBAsh The clone was received as an 
agar stab, which was streaked out on LB agar plates containing 50ug/ml ampicillin to 
isolate individual colonies. Six colonies were grown in LB medium with antibiotic, and 
plasmid DNA was prepared using the Promega Wizard Plus Miniprep DNA 

20 Purification System (Promega #A7500). These DNAs were subsequently digested 
withNotl and HinD III restriction enzymes to confirm that the clones contained an 
insert. The insert of one isolate was sequenced using a combination of vector specific 
and gene specific primers yielding a partial nucleotide sequence of human NgR3 of 
1 176 nucleotides (SEQ ID NO: 13). A translation of this sequence provides a partial 

25 sequence for human NgR3 of 392 amino acids (SEQ ID NO: 14). 

The nucleotide sequence of SEQ ID NO: 13 differs from the Incyte EST 
sequence at three positions. Nucleotide positions 12-13 in SEQ ID NO: 13 are CG, 
whereas the corresponding nucleotides in the Incyte Template ID 190989.1 are GT 
(i.e., positions 12-13 of the complement of Incyte Template ID 190989.1). In 

30 addition, position 641 in SEQ ID NO: 13 is a C, whereas the corresponding nucleotide 
in the Incyte Template ID 190989. 1 sequence is an A (i.e., position 641 of the 
complement of Incyte Template ID 190989.1). This results in two changes in amino 
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acids when comparing SEQ ID NO: 14 to the ORF encoded by Incyte Template 
190989. 1 : SEQ ID NO: 14 contains a valine at position 5, whereas the ORF encoded 
by Incyte Template ID 190989.1 contains a leucine; SEQ ID NO: 14 contains an 
alanine at position 214, whereas the ORF encoded by Incyte Template ID 190989.1 
5 contains a glutamic acid. 

The nucleotide sequence of SEQ ID NO: 13 differs from the public EST 
(Accession number: R35699; Version number: R35699. 1; GI: 792600) sequence at 
two positions (within the first 200 nucleotides of reliable sequence). Nucleotide 
positions 12-13 in SEQ ID NO: 13 are CG, whereas the corresponding nucleotides in 
10 the public EST are GT (i.e., positions 12-13 of the public EST; Accession no: R35699; 
Version no: R35699. 1; GI: 792600) This leads to a single amino acid change when 
comparing SEQ ID NO: 14 to the ORF encoded by the public EST: SEQ ID NO: 14 
contains a valine at position 5, while the ORF encoded by the public EST contains a 
leucine. 

15 A Bestfit analysis of the partial human amino acid sequence with the full-length 

mouse amino acid sequence indicates that the human NgR3 amino acid sequence is 
complete at the carboxy terminal end and that they share 89.54% identity. An 
alignment of all the NgR proteins is shown in Figure 9. Although the human NgR3 
amino acid sequence is missing the first 25 amino acids, it can be determined that the 

20 human NgR3 protein contains the following features in common with the other NgR 
sequences: (1) eight Leucine Rich Repeat (LRR) domains; (2) an LRR carboxy- 
terminal (LRR-CT) domain; (3) a conserved cysteine in the fourth LRR domain; (4) a 
conserved potential glycosylation site in the eighth LRR domain; and (5) a 
hydrophobic carboxyl terminus. 1 

25 As those skilled in the art will appreciate, numerous changes and modifications 

may be made to the preferred embodiments of the invention without departing from the 
spirit of the invention. It is intended that all such variations fall within the scope of the 
invention. 

The entire disclosure of each publication cited herein is hereby incorporated by 
30 reference. This application claims benefit from United States provisional application 
60/238,361, filed October 6, 2000, which is incorporated by reference herein in its 
entirety. 
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Key for Sequence Listing: 





SEQIDNO:l 


human NgR2 cDNA sequence denved from genomic sequence 
AC013606 




SEQDDNO:2 


human NgR2 amino acid sequence 


5 


SEQIDNO:3 


mouse NgR3 cDNA sequence derived from AC021768 




SEQIDNO:4 


a mouse NgR3 amino acid sequence 




SEQ ID N0:5 


a human NgRl amino acid sequence 




SEQIDNO:6 


a consensus amino acid sequence for NgRs 




SEQ ID NO:7 


#1055-1 120 amino acid residues of HNogoA (Nogo-66) 


10 


SEQIDNO:8 


a mature human NgR2 amino acid sequence 




SEQ ID NO:9 


a mature mouse NgR3 amino acid sequence 




SEQ ID NO: 10 


a consensus NgR LLRNT amino acid sequence 




SEQIDNO:ll 


a consensus NgR LRRCT domain amino acid sequence 




SEQ ID NO : 12 


a consensus NgR LRR domain amino acid sequence 


15 


SEQ ID NO: 13 


a partial human NgR3 nucleotide sequence 




SEQIDNO:14 


a partial human NgR3 amino acid sequence 




SEQIDNO:15 


a genomic sequence encoding a human NgR2 sequence. 




SEQ ID NO: 16 


a genomic sequence (complementary strand) encoding a mouse 
NgR3 


20 


SEQ ID NO: 17 


a mouse NgRl amino acid sequence 




SEQ ID NO: 18 


a consensus sequence for the NTLRRCT domain of NgR 




SEQIDNO:19 


an consensus NgR LRRCT domain amino acid sequence 



25 



30 
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CLAIMS 

What is claimed is: 

5 

1 . An isolated nucleic acid comprising a nucleotide sequence encoding 
a polypeptide comprising an LRRCT domain consisting of the amino acid sequence: 

NX 1 WX 2 CX 3 CRARX 4 LWX 5 WX < X 7 XgX 9 RX 10 SSS X u V 

10 

X 12 C X 13 X 14 P X 15 Xj^ X 17 X 18 X i9 X2Q D L Xj! X22 L X23 X 24 X M D 

X 26 X 27 X 28 C[SEQIDNO: 19] 

15 wherein X is any amino acid or a gap and the polypeptide does not comprise 

the amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 (human NgRl) or 
SEQ ID NO: 17 (mouse NgRl). 

2. The isolated nucleic acid according to claim 1, wherein X 17 and X^ 
20 are independently selected from the group consisting of: arginine and lysine. 

3. The isolated nucleic acid according to claim 2, wherein the amino 
acid sequence of the LRRCT domain is selected from the group consisting of: residues 
#261-3 10 of SEQ ID NO:2 and residues 261-3 10 of SEQ ID NO: 2 with up to 10 

25 conservative amino acid substitutions. 

4. An isolated nucleic acid encoding the polypeptide of SEQ ID NO: 2. 

5. An isolated nucleic acid encoding the polypeptide of SEQ ID NO: 4 
30 (mouse NgR3) or SEQ ID NO: 14 (human NgR3). 



6. The isolated nucleic acid according to claim 1, wherein the 
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polypeptide comprises: (a) a NTLRRCT domain, and (b) less than a complete CTS 
domain, provided that a partial CTS domain, if present, consists of no more than the 
first 39 amino acids of the CTS domain. 

5 7. The isolated nucleic acid to claim 1, wherein the polypeptide does 

not comprise an intact GPI domain. 

8. An isolated nucleic acid consisting essentially of a nucleotide 
sequence complementary to a nucleotide sequence encoding a polypeptide selected 
10 from the group consisting of: a polypeptide consisting of residues 3 1 1-395 of SEQ ID 
NO: 2, a polypeptide consisting of residues 256-396 of SEQ ID NO: 14 and a 
polypeptide consisting of residues 321-438 of SEQ ID NO: 4, wherein the nucleic acid 
is from 8 to 100 nucleotides in length. 

15 9. A vector comprising the nucleic acid of any one of claims 1, 4 or 5. 

. 10. A host cell comprising a vector according to claim 9. 

1 1. A polypeptide comprising a LRRCT amino acid sequence: 

20 

NX 1 WX 2 CX 3 CRARX 4 LWX 5 WX 6 X 7 X 8 X 9 RX 10 SSSX 11 V 

X 12 C X 13 X 14 P X ls X 16 X 17 X 18 X w X 20 D L X21 X22 L X23 X 24 X^ D 

25 X 26 Xv C [SEQ ID NO: 19] 

wherein X is any amino acid residue or a gap and the polypeptide does not 
comprise the amino acid sequence from residue 260 to 309 of SEQ ID NO: 5 
(human NgRl) or SEQ ID NO: 17 (mouse NgRl). 

30 

12. The polypeptide according to claim 1 1, wherein X 17 and X a is 
selected from the group consisting of arginine and lysine. 
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13. The polypeptide according to claim 1 1, wherein X 19 is glycine. 
[SEQIDNO:ll] 

14. The polypeptide according to claim 1 1, wherein the amino acid 

5 sequence is selected from the group consisting of residues 261-3 10 of SEQ ID NO:2, 
residues 206-255 of SEQ ID NO: 14, residues 271-320 of SEQ ID NO.4 and amino 
acid sequences thereof comprising a conservative substitution. 



15. A polypeptide comprising a NTLRRCT amino acid sequence: 



10 



C P Xj X 2 C X 3 C Y X4 X5 P Xg X 7 T X 8 S C X9 X 10 X u X 12 X 13 X 14 X 15 X 16 P 

^17 ^18 -^19 P ^20 ^21 ^22 ^23 ^ ^24 F L X 25 X^ N X 27 I X 28 X^ X 30 X 31 X 32 X 33 

X 34 F X 35 X 36 X 37 X 38 X 39 X^ X 41 X 42 L W X^ X^ S N X^ X 46 X^ 7 X 48 1 X 49 
X50 Xj! X^ F Xj 3 X^ X 55 X5 6 Xj 7 L E X53 L D L X^ D N X 60 X^ L X^ X^ X^ 

15 X55 P Xg 6 T F X$ 7 G L Xg 8 X$ 9 L X 70 JCji L X^ L X^ X 74 C X 75 L X 76 X7 7 L X 78 

X ra Xg 0 X 8 i F Xjj GL Xgj X M LQYLYLQX, 5 NXj 6 X w Xg 8 X 89 L Xjq D 
X^i X^ F X^ D L X^ N L X^ H L F L H G N X^ X^ X^ Xp 9 X 100 X 101 X 102 

^io4 F R G L X 105 X 106 L D R L L L H X 107 N X 108 X 109 X 110 X U1 V H X U2 
X 113 A F X 114 Xi 15 L X 116 R L X 117 X 118 L X 119 L F X 120 N X 121 L X 122 X 123 L 

20 X 124 X 125 X 126 X 127 L X 128 X 129 L X 130 X 131 L X 132 X 133 L R L N X 134 N X 135 W 

^136 C X 137 C R X 138 R X 139 L W X 140 W X 141 X 142 X 143 X 144 R X 145 S S S X 146 
V X 147 C X 148 X 149 P X 150 X 151 X I52 X 153 X 154 X 15S D L X I36 X 157 L X 15g X 159 X 160 
D X M1 X 162 X 163 C [SEQ ID N0.18] 

25 wherein X is any amino acid residue or a gap and wherein the polypeptide is 

not the polypeptide of SEQ ID NO: 5 (human NgRl) or SEQ ID NO: 17 
(mouse NgRl). 

16. The polypeptide according to claim 15, wherein X 37 and X 38 
30 represents a gap. 

17. A polypeptide comprising an amino sequence selected from the 
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group consisting of: SEQ ID NO:2, SEQ ID NO:4 and SEQ ID NO: 14. 

18. The polypeptide according any one of claims 1 1, 15 or 17, wherein 
the polypeptide comprises: (a) an NTLRRCT domain, and (b) less than a complete 

5 CTS domain, provided that a partial CTS domain, if present, consists of no more than 
the first 39 amino acids of the CTS domain. 

19. The polypeptide according to any one of claims 11, 15 or 17, 
wherein the polypeptide does not comprise an intact GPI domain. 

10 

20. The polypeptide according to any one of claims 1 1, 15 or 17, 
wherein the amino acid sequence of the polypeptide further comprises an amino acid 
sequence of a heterologous polypeptide. 

15 21. The polypeptide according to claim 20, wherein the heterologous 

polypeptide is an Fc portion of an antibody. 

22. A method of producing a polypeptide according to any one of 
claims 11, 15 or 17, comprising the steps of introducing an isolated nucleic acid 

20 according to any one of claims 1, 4, 5 or 8 or a vector according to claim 9 into a host 
cell, culturing said host cell under conditions suitable for expression of said 
polypeptide, and recovering said polypeptide. 

23. An antibody that binds to a polypeptide of any one of claims 11,15 

25 or 17. 

24. A composition comprising the polypeptide of claim 1 1, 15 or 17 
and a pharmaceutically acceptable carrier. 



30 



25. A composition comprising the antibody of claim 23 and a 
pharmaceutically acceptable carrier. 
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26. A method of decreasing inhibition of axonal growth of a CNS 
neuron, comprising the step of contacting the neuron with an effective amount of the 
polypeptide of claim 1 1, 15 or 17. 

5 27. A method of treating a central nervous system disease, disorder or 

injury, comprising administering to a mammal an effective amount of the polypeptide 
of claim 11, 15 or 17. 

28. A method of decreasing inhibition of axonal growth of a CNS 
10 neuron comprising the step of contacting the neuron with an effective amount of the 

antibody according to claim 23 . 

29. A method of treating a central nervous system disease, disorder or 
injury, comprising administering to a mammal an effective amount of the antibody 

1 5 according to claim 23 . 

30. A method for identifying a molecule that binds a polypeptide of 
claim 1 1, 15 or 17 comprising the steps of: 

(a) providing a polypeptide of claim 1 1, 15 or 17; 
20 (b) contacting the polypeptide with the candidate molecule; 

and 

(c) detecting binding of the candidate molecule to the 
polypeptide. 
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SEQUENCE LISTING 



<110> BIOGEN, INC. 

YALE UNIVERSITY 
STRITTMATTER, STEPHEN M. 
GATE, RICHARD L. 
SAH, DINAH W.Y. 

<120> NOGO RECEPTOR HOMOLOGS 

<130> A116PCT 

<140> 
<141> 

<150> 60/238 f 361 
<151> 2000-10-06 

<160> 16 

<170> Patentln Ver. 2.1 

<210> 17 
<211> 1260 
<212> DNA 

<213> Homo sapiens 
<400> 1 

atgctgcccg ggctcaggcg cctgctgcaa gctcccgcct cggcctgcct cctgctgatg 60 
ctcctggccc tgcccctggc ggcccccagc tgccccatgc tctgcacctg ctactcatcc 120 
ccgcccaccg tgagctgcca ggccaacaac ttctcctctg tgccgctgtc cctgccaccc 180 
agcactcagc gactcttcct gcagaacaac ctcatccgca cgctgcggcc aggcaccttt 240 
gggtccaacc tgctcaccct gtggctcttc tccaacaacc tctccaccat ctacccgggc 300 
actttccgcc acttgcaagc cctggaggag ctggacctcg gtgacaaccg gcacctgcgc 360 
tcgctggagc ccgacacctt ccagggcctg gagcggctgc agtcgctgca tttgtaccgc 420 
tgccagctca gcagcctgcc cggcaacatc ttccgaggcc tggtcagcct gcagtacctc 480 
tacctccagg agaacagcct gctccaccta caggatgact tgttcgcgga cctggccaac 540 
ctgagccacc tcttcctcca cgggaaccgc ctgcggctgc tcacagagca cgtgtttcgc 600 
ggcctgggca gcctggaccg gctgctgctg cacgggaacc ggctgcaggg cgtgcaccgc 660 
gcggccttcc gcggcctcag ccgcctcacc atcctctacc tgttcaacaa cagcctggcc 720 
tcgctgcccg gcgaggcgct cgccgacctg ccctcgctcg agttcctgcg gctcaacgct 780 
aacccctggg cgtgcgactg ccgcgcgcgg ccgctctggg cctggttcca gcgcgcgcgc 840 
gtgtccagct ccgacgtgac ctgcgccacc cccccggagc gccagggccg agacctgcgc 900 
gcgctccgcg aggccgactt ccaggcgtgt ccgcccgcgg cacccacgcg gccgggcagc 960 
cgcgcccgcg gcaacagctc ctccaaccac ctgtacgggg tggccgaggc cggggcgccc 1020 
ccagccgatc cctccaccct ctaccgagat ctgcctgccg aagactcgcg ggggcgccag 1080 
ggcggggacg cgcctactga ggacgactac tgggggggct acgggggtga ggaccagcga 1140 
ggggagcaga tgtgccccgg cgctgcctgc caggcgcccc cggactcccg aggccctgcg 1200 
ctctcggccg ggctccccag ccctctgctt tgcctcctgc tcctggtgcc ccaccacctc 1260 



<210> 2 
<211> 420 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Leu Pro Gly Leu Arg Arg Leu Leu Gin Ala Pro Ala Ser Ala Cys 
1 5 10 15 

Leu Leu Leu Met Leu Leu Ala Leu Pro Leu Ala Ala Pro Ser Cys Pro 
20 25 30 
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Met Leu Cys Thr Cys Tyr Ser Ser Pro Pro Thr Val Ser Cys Gin Ala 
35 ~ 40 45 

Asn Asn Phe Ser Ser Val Pro Leu Ser Leu Pro Pro Ser Thr Gin Arg 
50 55 60 

Leu Phe Leu Gin Asn Asn Leu lie Arg Thr Leu Arg Pro Gly Thr Phe 
65 70 75 80 

Gly Ser Asn Leu Leu Thr Leu Trp Leu Phe Ser Asn Asn Leu Ser Thr 
85 90 95 

lie Tyr Pro Gly Thr Phe Arg His Leu Gin Ala Leu Glu Glu Leu Asp 
100 105 110 

Leu Gly Asp Asn Arg His Leu Arg Ser Leu Glu Pro Asp Thr Phe Gin 
115 120 125 

Gly Leu Glu Arg Leu Gin Ser Leu His Leu Tyr Arg Cys Gin Leu Ser 
130 135 140 

Ser Leu Pro Gly Asn lie Phe Arg Gly Leu Val Ser Leu Gin Tyr Leu 
145 " 150 155 160 

Tyr Leu Gin Glu Asn Ser Leu Leu His Leu Gin Asp Asp Leu Phe Ala 
165 170 175 

Asp Leu Ala Asn Leu Ser His Leu Phe Leu His Gly Asn Arg Leu Arg 
180 185 190 

Leu Leu Thr Glu His Val Phe Arg Gly Leu Gly Ser Leu Asp Arg Leu 
195 200 205 

Leu Leu His Gly Asn Arg Leu Gin Gly Val His Arg Ala Ala Phe Arg 
210 ~ 215 220 

Gly Leu Ser Arg Leu Thr lie Leu Tyr Leu Phe Asn Asn Ser Leu Ala 
225 " 230 235 240 

Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu Pro Ser Leu Glu Phe Leu 
245 250 255 

Arg Leu Asn Ala Asn Pro Trp Ala Cys Asp Cys Arg Ala Arg Pro Leu 
260 265 " 270 

Trp Ala Trp Phe Gin Arg Ala Arg Val Ser Ser Ser Asp Val Thr Cys 
275 280 285 

Ala Thr Pro Pro Glu Arg Gin Gly Arg Asp Leu Arg Ala Leu Arg Glu 
290 295 300 

Ala Asp Phe Gin Ala Cys Pro Pro Ala Ala Pro Thr Arg Pro Gly Ser 
305 310 315 320 

Arg Ala Arg Gly Asn Ser Ser Ser Asn His Leu Tyr Gly Val Ala Glu 
325 330 335 



Ala Gly Ala Pro Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp Leu Pro 
340 345 350 

Ala Glu Asp Ser Arg Gly Arg Gin Gly Gly Asp Ala Pro Thr Glu Asp 
355 360 365 

Asp Tyr Trp Gly Gly Tyr Gly Gly Glu Asp Gin Arg Gly Glu Gin Met 
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370 



375 



380 



Cys Pro Gly Ala Ala Cys Gin Ala Pro Pro Asp Ser Arg Gly Pro Ala 

385 390 395 400 

Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu Leu Val 

405 410 415 



Pro His His Leu 
420 



<210> 3 
<211> 1383 
<212> DNA 
<213> Mus sp. 



<400> 3 

atgtcttggc 

gggtgctgtg 

tgtcctcgag 

tttgctgcca 

cgcatcacct 

tccaacaaca 

ctagaccttg 

gtgaagcttc 

tttggtggcc 

caagatgaca 

ctatggagcc 

catgagaacc 

accctctttc 

gtggccttgg 

tccctgtggg 

cccgagctgc 

acaggaccag 

gcccgcaagg 

catccgcctg 

aaccggaacc 

gcccccgact 

aagggcaagt 

tcaggcacgg 

cgc 



agtctggaac 
tggaattgct 
actgtgtgtg 
tcccggaggg 
tcctccagca 
tcactttcat 
gagacaaccg 
acgccctcta 
tgcacagcct 
tctttgtgga 
tgggccaagg 
agctacagtg 
tcttcaacaa 
agttccttcg 
aatggctgcg 
ggcaaggcca 
tgtctcctca 
agcaccatcc 
gctccaggtc 
agatctctaa 
atcagcacaa 
gtgctcgcag 
cccttggggc 



cacagtgaca 
gctgttgctg 
ctaccctgcg 
catcccagag 
gggccacttc 
tgctcccaac 
acagctgcga 
cctctataag 
gcagtatctc 
cctggtcaat 
catcttccgg 
ggttcaccac 
cagcctcact 
cctcaatggg 
aaggttccgt 
ggatctgaag 
ccagatcaag 
gtcccatggg 
aggttacaag 
ggtgagctct 
gttcagcttt 
gacccccatc 
cccactcctg 



caatctcccg 

ctcgctggag 

cccatgactg 

gacagtgagc 

agccccgcca 

accttcgagg 

acgctggcac 

tgtggactga 

tacttgcagg- 

ctcagtcact: 

ggcctggtga' 

aaggctttcc 

gagctgcagg 

aatgcttggg 

ggctctagct 

ctgctgaggg 

tctcacacgc 

gcctccaggg 

aaggcaggca 

gggaaagagc 

gacatcatgc 

cgtgccccca 

gcctggatac 



tgcaggctgc 
agctacctct 
tcagctgcca 
gcatcttcct 
tggtcaccct 
gctttgtgca 
ccgagacctt 
gcgccctgcc 
acaaccatat 
tgtttctcca 
acctggaccg 
atgacctcca 
gtgactgtct 
actgtggctg 
ctgctgtccc 
tggaggactt 
ttaccacctc 
acaaaggcca 
agaactgcac 
ttaccgaact 
ccaccgcacg 

gtggggtgca 

tggggctggc 



tcaggtctca 
gggtggtggt 
ggcacacaac 
gcagaacaat 
ctggatctac 
tctggaggag 
ccaaggcctg 
cgcaggcatc 
cgagtacctc 
tggtaacaag 
gttgctgctg 
caggctaacc 
ggcccccctg 
ccgggcacgt 
ctgcgcgacc 
ccggaactgc 
tgacagggct 
cccacatggc 
cagccacagg 
gcaggactat 
acccaagagg 
gcaggcatcc 
agtcactctc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1383 



<210> 4 
<211> 461 
<212> PRT 
<213> Mus sp. 

<400> 4 

Met Ser Trp Gin Ser Gly Thr Thr Val Thr Gin Ser Pro Val Gin Ala 
1 5 10 15 

Ala Gin Val Ser Gly Cys Cys Val Glu Leu Leu Leu Leu Leu Leu Ala 
20 4 25 30 

Gly Glu Leu Pro Leu Gly Gly Gly Cys Pro Arg Asp Cys Val Cys Tyr 
35 40 45 

Pro Ala Pro Met Thr Val Ser Cys Gin Ala His Asn Phe Ala Ala He 
50 55 60 

Pro Glu Gly He Pro Glu Asp Ser Glu Arg He Phe Leu Gin Asn Asn 
65 70 75 80 
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Arg lie Thr Phe Leu Gin Gin Gly His Phe Ser Pro Ala Met Val Thr 
85 90 95 

Leu Trp He Tyr Ser Asn Asn He Thr Phe He Ala Pro Asn Thr Phe 
100 105 HO 

Glu Gly Phe Val His Leu Glu Glu Leu Asp Leu Gly Asp Asn Arg Gin 
115 120 125 

Leu Arg Thr Leu Ala Pro Glu Thr Phe Gin Gly Leu Val Lys Leu His 
130 135 140 

Ala Leu Tyr Leu Tyr Lys Cys Gly Leu Ser Ala Leu Pro Ala Gly He 
145 " 150 155 160 

Phe Gly Gly Leu His Ser Leu Gin Tyr Leu Tyr Leu Gin Asp Asn His 
165 170 175 

He Glu Tyr Leu Gin Asp Asp He Phe Val Asp Leu Val Asn Leu Ser 
180 185 190 

His Leu Phe Leu His Gly Asn Lys Leu Trp Ser Leu Gly Gin Gly He 
195 ' 200 205 

Phe Arg Gly Leu Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gin 
210 215 220 

Leu Gin Trp Val His His Lys Ala Phe His Asp Leu His Arg Leu Thr 
225 ' 230 235* 240 

Thr Leu Phe Leu Phe Asn Asn Ser Leu Thr Glu Leu Gin Gly Asp Cys 
245 250 255 

Leu Ala Pro Leu Val Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn Ala 
260 265 270 

Trp Asp Cys Gly Cys Arg Ala Arg Ser Leu Trp Glu Trp Leu Arg Arg 
275 280 285 

Phe Arg Gly Ser Ser Ser Ala Val Pro Cys Ala Thr Pro Glu Leu Arg 
290 295 300 

Gin Gly Gin Asp Leu Lys Leu Leu Arg Val Glu Asp Phe Arg Asn Cys 
305 310 315 320 

Thr Gly Pro Val Ser Pro His Gin He Lys Ser His Thr Leu Thr Thr 
325 330 335 

Ser Asp Arg Ala Ala Arg Lys Glu His His Pro Ser His Gly Ala Ser 
340 345 350 

Arg Asp Lys Gly His Pro His Gly His Pro Pro Gly Ser Arg Ser Gly 
355 360 365 

Tyr Lys Lys Ala Gly Lys Asn Cys Thr Ser His Arg Asn Arg Asn Gin 
370 375 380 

He Ser Lys Val Ser Ser Gly Lys Glu Leu Thr Glu Leu Gin Asp Tyr 
385 ^ 390 395 400 

Ala Pro Asp Tyr Gin His Lys Phe Ser Phe Asp He Met Pro Thr Ala 
405 410 415 

Arg Pro Lys Arg Lys Gly Lys Cys Ala Arg Arg Thr Pro He Arg Ala 



WO 02/29059 



PCT/US01/31488 



420 



425 



430 



Pro Ser Gly Val Gin Gin Ala Ser Ser Gly Thr Ala Leu Gly Ala Pro 
435 440 445 

Leu Leu Ala Trp lie Leu Gly Leu Ala Val Thr Leu Arg 



<210> 5 

<211> 473 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Met Lys Arg Ala Ser Ala Gly Gly Ser Arg Leu Leu Ala Trp Val Leu 
15 10 15 

Trp Leu Gin Ala Trp Gin Val Ala Ala Pro Cys Pro Gly Ala Cys Val 
20 25 30 

Cys Tyr Asn Glu Pro Lys Val Thr Thr Ser Cys Pro Gin Gin Gly Leu 
35 40 45 

Gin Ala Val Pro Val Gly lie Pro Ala Ala Ser Gin Arg lie Phe Leu 
50 55 60 

His Gly Asn Arg lie Ser His Val Pro Ala Ala Ser Phe Arg Ala Cys 
65 " 70 75 80 

Arg Asn Leu Thr lie Leu Trp Leu His Ser Asn Val Leu Ala Arg lie 
85 90 95 

Asp Ala Ala Ala Phe Thr Gly Leu Ala Leu Leu Glu Gin Leu Asp Leu 
100 105 110 

Ser Asp Asn Ala Gin Leu Arg Ser Val Asp Pro Ala Thr Phe His Gly 
115 120 125 

Leu Gly Arg Leu His Thr Leu His Leu Asp Arg Cys Gly Leu Gin Glu 
130 " 135 140 

Leu Gly Pro Gly Leu Phe Arg Gly Leu Ala Ala Leu Gin Tyr Leu Tyr 
145 * 150 155 160 

Leu Gin Asp Asn Ala Leu Gin Ala Leu Pro Asp Asp Thr Phe Arg Asp 
165 170 175 

Leu Gly Asn Leu Thr His Leu Phe Leu His Gly Asn Arg lie Ser Ser 
180 185 190 

Val Pro Glu Arg Ala Phe Arg Gly Leu His Ser Leu Asp Arg Leu Leu 
195 *" 200 205 

Leu His Gin Asn Arg Val Ala His Val His Pro His Ala Phe Arg Asp 
210 215 220 

Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn Asn Leu Ser Ala 
225 " 230 235 240 

Leu Pro Thr Glu Ala Leu Ala Pro Leu Arg Ala Leu Gin Tyr Leu Arg 
245 250 255 

Leu Asn Asp Asn Pro Trp Val Cys Asp Cys Arg Ala Arg Pro Leu Trp 



450 



455 



460 
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260 265 270 

Ala Trp Leu Gin Lys Phe Arg Gly Ser Ser Ser Glu Val Pro Cys Ser 
275 *" 280 285 

Leu Pro Gin Arg Leu Ala Gly Arg Asp Leu Lys Arg Leu Ala Ala Asn 
290 295 300 

Asp Leu Gin Gly Cys Ala Val Ala Thr Gly Pro Tyr His Pro lie Trp 
305 310 315 320 

Thr Gly Arg Ala Thr Asp Glu Glu Pro Leu Gly Leu Pro Lys Cys Cys 
325 330 335 

Gin Pro Asp Ala Ala Asp Lys Ala Ser Val Leu Glu Pro Gly Arg Pro 
340 345 350 

Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val Pro Pro Gly Asp Ser 
355 360 365 

Pro Pro Gly Asn Gly Ser Gly Pro Arg His lie Asn Asp Ser Pro Phe 
370 375 380 

Gly Thr Leu Pro Gly Ser Ala Glu Pro Pro Leu Thr Ala Val Arg Pro 
385 390 395 400 

Glu Gly Ser Glu Pro Pro Gly Phe Pro Thr Ser- Gly Pro Arg Arg Arg 
405 ~ 410 ' 415 

Pro Gly Cys Ser Arg Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 
420 425 430 

Gin Ala Gly Ser Gly Gly Gly Gly Thr Gly Asp Ser Glu Gly Ser Gly 
435 440 445 

Ala Leu Pro Ser Leu Thr Cys Ser Leu Thr Pro Leu Gly Leu Ala Leu 
450 455 460 

Val Leu Trp Thr Val Leu Gly Pro Cys 
465 470 



<210> 6 
<211> 440 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 

<222> (3) . . (4) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (9) . . (10) 
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<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (12) . . (13) 

<223> Variable amino acid 

<220> 

<221> M0D_RES 
<222> (15) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (18) • . (25) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (27) . . (29) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (31) . . (34) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (36) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (39) . . (40) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (42) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (44) . . (50) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (52) (59) 

<223> Variable amino acid 

<220> 

<221> MODJRES 

<222> (62) . . (63) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (66) . . (69) 

<223> Variable amino acid 

<220> 

<221> MOD RES 
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<222> (71),. (74) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (76) . . (80) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (83) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (87) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (90).. (91) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (94) . . (96) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (98) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (101) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (104) . . (105) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (107) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (109) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 

<222> (111).. (112) 

<223> Variable amino acid 



<220> 

<221> MOD_RES 
<222> (114) 

<223> Variable amino acid 



<220> 
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<221> MOD_RES 

<222> (116).. (117) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (119) . . (122) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (124) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (127) . . (128) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (136) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (138) . . (141) 

<223> Variable amino acid 

<220> 

<221> MODJRES 
<222> (143) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (146) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (148) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (151) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (154) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (162) . . (170) 

<223> Variable amino acid 

<220> 

<221> MODJRES 

<222> (175) . . (176) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (184) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (186) . . (189) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (192) . . (193) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (196) . . (197) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (199) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (202) . . (203) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (205) 

<223> Variable amino acid 
<220> 

<221> M0D_RES 
<222> (208) 

<223> Variable amino acid 
<220> 

<221> MOD_RE5 
<222> (210) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (212) . . (213) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (215) . . (218) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (221) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (223) . . (224) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (226) . . (227) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (232) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (234) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (236) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (238) 

<223> Variable amino acid 
<220> 

<221> MOD__RES 
<222> (243) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (246) 

<223> Variable amino acid 
<220> 

<221> MODJRES 

<222> (248) . . (251) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (253) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (257) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (259) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (261) . . (262) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (264) . . (267) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (269) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (272) . . (273) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (275) . . (277) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (279) . . (281) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (283) . . (287) 

<223> Variable amino acid 

<220> 

<221> MODJRES 

<222> (289).. (290) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (292) . . (328) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (330) . . (341) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (344) . . (346) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (348) . . (399) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (401) . . (428) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (431) . . (439) 

<223> Variable amino acid 

<400> 6 

Cys Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
15 10 15 
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Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Xaa Pro Xaa Xaa 
20 25 30 

Xaa Xaa Arg Xaa Phe Leu Xaa Xaa Asn Xaa lie Xaa Xaa Xaa Xaa Xaa 
35 40 45 

Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser 
50 55 60 

Asn Xaa Xaa Xaa Xaa He Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa 
65 70 75 80 

Leu Glu Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa 
85 90 95 

Pro Xaa Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu Xaa Leu Xaa Xaa 
100 ~ 105 110 

Cys Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa 
115 120 125 

Leu Gin Tyr Leu Tyr Leu Gin Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp 
130 " 135 140 

Asp Xaa Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly 
145 * 150 155 160 

Asn Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Arg Gly Leu Xaa Xaa 
165 170 175 

Leu Asp Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His Xaa 
180 185 190 

Xaa Ala Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa 
195 200 205 

Asn Xaa Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa 
210 215 220 

Leu Xaa Xaa Leu Arg Leu Asn Xaa Asn Xaa Trp Xaa Cys Xaa Cys Arg 
225 230 235 240 

Ala Arg Xaa Leu Trp Xaa Trp Xaa Xaa Xaa Xaa Arg Xaa Ser Ser Ser 
245 250 255 

Xaa Val Xaa Cys Xaa Xaa Pro Xaa Xaa Xaa Xaa Gly Xaa Asp Leu Xaa 
260 265 270 

Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa Xaa Cys Xaa Xaa Xaa Xaa Xaa Pro 
275 280 285 

Xaa Xaa Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
290 295 300 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 310 315 320 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
325 330 335 

Xaa Xaa Xaa Xaa Xaa Pro Pro Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa 
340 345 350 * 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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355 360 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
370 375 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
385 390 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
405 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
420 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu 
435 440 



365 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
380 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg 
395 400 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
410 415 

Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa 
425 430 



<210> 7 

<211> 66 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Arg He Tyr Lys Gly Val He Gin Ala He Gin Lys Ser Asp Glu Gly 
15 10 15 

His Pro Phe Arg Ala Tyr Leu Glu Ser Glu Val Ala He Ser Glu Glu 
20 25 30 

Leu Val Gin Lys Tyr Ser Asn Ser Ala Leu Gly His Val Asn Cys Thr 
35 40 45 

He Lys Glu Leu Arg Arg Leu Phe Leu Val Asp Asp Leu Val Asp Ser 
50 55 60 

Leu Lys 
65 



<210> 8 

<211> 390 

<212> PRT 

<213> Homo sapeins 

<400> 8 

Cys Pro Met Leu Cys Thr Cys Tyr Ser Ser Pro Pro Thr Val Ser Cys 
1 5 10 15 

Gin Ala Asn Asn Phe Ser Ser Val Pro Leu Ser Leu Pro Pro Ser Thr 
20 25 30 

Gin Arg Leu Phe Leu Gin Asn Asn Leu He Arg Thr Leu Arg Pro Gly 
35 40 45 

Thr Phe Gly Ser Asn Leu Leu Thr Leu Trp Leu Phe Ser Asn Asn Leu 
50 55 60 

Ser Thr He Tyr Pro Gly Thr Phe Arg His Leu Gin Ala Leu Glu Glu 
65 70 75 80 

Leu Asp Leu Gly Asp Asn Arg His Leu Arg Ser Leu Glu Pro Asp Thr 
85 90 95 
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Phe Gin Gly Leu Glu Arg Leu Gin Ser Leu His Leu Tyr Arg Cys Gin 
100 105 110 

Leu Ser Ser Leu Pro Gly Asn He Phe Arg Gly Leu Val Ser Leu Gin 
115 120 125 

Tyr Leu Tyr Leu Gin Glu Asn Ser Leu Leu His Leu Gin Asp Asp Leu 
130 135 140 

Phe Ala Asp Leu Ala Asn Leu Ser His Leu Phe Leu His dly Asn Arg 
145 150 155 160 

Leu Arg Leu Leu Thr Glu His Val Phe Arg Gly Leu Gly Ser Leu Asp 
165 170 175 

Arg Leu Leu Leu His Gly Asn Arg Leu Gin Gly Val His Arg Ala Ala 
180 185 190 

Phe Arg Gly Leu Ser Arg Leu Thr He Leu Tyr Leu Phe Asn Asn Ser 
195 200 205 

Leu Ala Ser Leu Pro Gly Glu Ala Leu Ala Asp Leu Pro Ser Leu Glu 
210 215 220 

Phe Leu Arg Leu Asn Ala Asn Pro Trp Ala Cys Asp Cys Arg Ala Arg 
225 230 235 240 

Pro Leu Trp Ala Trp Phe Gin Arg Ala Arg Val Ser Ser Ser Asp Val 
245 250 255 

Thr Cys Ala Thr Pro Pro Glu Arg Gin Gly Arg Asp Leu Arg Ala Leu 
260 265 270 

Arg Glu Ala Asp Phe Gin Ala Cys Pro Pro Ala Ala Pro Thr Arg Pro 
275 280 285 

Gly Ser Arg Ala Arg Gly Asn Ser Ser Ser Asn His Leu Tyr Gly Val 
290 295 300 

Ala Glu Ala Gly Ala Pro Pro Ala Asp Pro Ser Thr Leu Tyr Arg Asp 
305 ~ 310 315 320 

Leu Pro Ala Glu Asp Ser Arg Gly Arg Gin Gly Gly Asp Ala Pro Thr 
325 330 335 

Glu Asp Asp Tyr Trp Gly Gly Tyr Gly Gly Glu Asp Gin Arg Gly Glu 
340 345 350 

Gin Met Cys Pro Gly Ala Ala Cys Gin Ala Pro Pro Asp Ser Arg Gly 
355 * 360 365 

Pro Ala Leu Ser Ala Gly Leu Pro Ser Pro Leu Leu Cys Leu Leu Leu 
370 375 380 



Leu Val Pro His His Leu 
385 390 



<210> 9 
<211> 421 
<212> PRT 
<213> Mus sp. 

<400> 9 
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Cys Pro Arg Asp Cys Val Cys Tyr Pro Ala Pro Met Thr Val Ser Cys 
1 5 10 15 

Gin Ala His Asn Phe Ala Ala He Pro Glu Gly He Pro Glu Asp Ser 
20 25 30 

Glu Arg He Phe Leu Gin Asn Asn Arg He Thr Phe Leu Gin Gin Gly 
35 40 45 

His Phe Ser Pro Ala Met Val Thr Leu Trp He Tyr Ser Asn Asn He 
50 55 60 

Thr Phe lie Ala Pro Asn Thr Phe Glu Gly Phe Val His Leu Glu Glu 
65 70 75 80 

Leu Asp Leu Gly Asp Asn Arg Gin Leu Arg Thr Leu Ala Pro Glu Thr 
85 90 95 

Phe Gin Gly Leu Val Lys Leu His Ala Leu Tyr Leu Tyr Lys Cys Gly 
100 105 110 

Leu Ser Ala Leu Pro Ala Gly He Phe Gly Gly Leu His Ser Leu Gin 
115 120 125 

Tyr Leu Tyr Leu Gin Asp Asn His He Glu Tyr Leu Gin Asp Asp He 
130 135 140 

Phe Val Asp Leu Val Asn Leu Ser His Leu Phe Leu His Gly Asn Lys 
145 150 155' 160 

Leu Trp Ser Leu Gly Gin Gly He Phe Arg Gly Leu Val Asn Leu Asp 
165 170 175 

Arg Leu Leu Leu His Glu Asn Gin Leu Gin Trp Val His His Lys Ala 
180 185 190 

Phe His Asp Leu His Arg Leu Thr Thr Leu Phe Leu Phe Asn Asn Ser 
195 200 205 

Leu Thr Glu Leu Gin Gly Asp Cys Leu Ala Pro Leu Val Ala Leu Glu 
210 215 220 

Phe Leu Arg Leu Asn Gly Asn Ala Trp Asp Cys Gly Cys Arg Ala Arg 
225 " 230 235 240 

Ser Leu Trp Glu Trp Leu Arg Arg Phe Arg Gly Ser Ser Ser Ala Val 
245 250 255 

Pro Cys Ala Thr Pro Glu Leu Arg Gin Gly Gin Asp Leu Lys Leu Leu 
260 265 270 

Arg Val Glu Asp Phe Arg Asn Cys Thr Gly Pro Val Ser Pro His Gin 
275 280 285 



He Lys Ser His Thr Leu Thr Thr Ser Asp Arg Ala Ala Arg Lys Glu 
290 295 300 

His His Pro Ser His Gly Ala Ser Arg Asp Lys Gly His Pro His Gly 
305 310 315 320 

His Pro Pro Gly Ser Arg Ser Gly Tyr Lys Lys Ala Gly Lys Asn Cys 
325 330 335 

Thr Ser His Arg Asn Arg Asn Gin He Ser Lys Val Ser Ser Gly Lys 
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340 345 350 

Glu Leu Thr Glu Leu Gin Asp Tyr Ala Pro Asp Tyr Gin His Lys Phe 
355 360 365 

Ser Phe Asp He Met Pro Thr Ala Arg Pro Lys Arg Lys Gly Lys Cys 
370 375 380 

Ala Arg Arg Thr Pro He Arg Ala Pro Ser Gly Val Gin Gin Ala Ser 
385 ** 390 395 ' 400 

Ser Gly Thr Ala Leu Gly Ala Pro Leu Leu Ala Trp He Leu Gly Leu 
405 410 415 

Ala Val Thr Leu Arg 
420 



<210> 10 
<211> 17 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> M0D_RES 

<222> (3) . . (4) 

<223> Variable amino acid 

<220> 

<221> M0D_RES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (9) . . (10) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (12) . . (13) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (15) 

<223> Variable amino acid 
<400> 10 

Cys Pro Xaa Xaa Cys Xaa Cys Tyr Xaa Xaa Pro Xaa Xaa Thr Xaa Ser 
1 5 * 10 15 

Cys 



<210> 11 
<211> 50 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 
<222> (2) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (4) 

<223> Variable amino acid 
<220> 

<221> MODJRES 
<222> (6) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (11) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (14) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (16) . . (19) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (21) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (25) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (27) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (29) . . (30) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (32) . . (35) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (37) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (40) (41) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (43) . . (45) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (47) . . (49) 

<223> Variable amino acid 

<400> 11 

Asn Xaa Trp Xaa Cys Xaa Cys Arg Ala Arg Xaa Leu Trp Xaa Trp Xaa 
15 10 15 

Xaa Xaa Xaa Arg Xaa Ser Ser Ser Xaa Val Xaa Cys Xaa Xaa Pro Xaa 
20 25 30 

Xaa Xaa Xaa Gly Xaa Asp Leu Xaa Xaa Leu Xaa Xaa Xaa Asp Xaa Xaa 
35 40 45 

Xaa Cys 
50 



<210> 12 
<211> 196 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence 

<220> 

<221> MOD_RES 
<222> (2) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (5) . . (6) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (8) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (10) . . (16) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (IB) . . (25) 

<223> Variable amino acid 

<220> 
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<221> M0D_RES 

<222> (28) . . (29) 

<223> Variable amino acid 

<220> 

<221> M0D_RES 

<222> (32).. (35) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (37) . . (40) 

<223> Variable amino acid 

<220> 

<221> M0D_RES 

<222> (42) , . (46) 

<223> Variable amino acid 

<220> 

<221> M0D_RES 
<222> (49) 

<223> Variable amino acid 
<220> 

<221> M0D_RES 
<222> (53) 

<223> Variable amino acid 
<220> 

<221> M0D_RES 

<222> (56) . . (57) 

<223> Variable amino acid 

<220> 

<221> M0D_RES 

<222> (60) . . (62) 

<223> Variable amino acid 

<220> 

<221> M0D_RES 

<222> (64) 

<223> Variable amino acid 
<220> 

<221> MODJRES 

<222> (67) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (70) . . (71) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (73) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (75) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (77) . . (78) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (80) 

<223> Variable amino acid 
<220> 

<221> MOD RES 

<222> (82"f. . (83) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (85) . . (88) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (90) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (93) . . (94) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (102) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (104) . . (107) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (109) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (112) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (114) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (117) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (120) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 

<222> (128) . . (136) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (141) . . (142) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (150) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (152) . . (155) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (158) . . (159) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (162) . . (163) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (165) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (168) . . (169) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 
<222> (171) 

<223> Variable amino acid 
<220> 

<221> MODJRES 
<222> (174) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 
<222> (176) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (178) . . (179) 

<223> Variable amino acid 

<220> 

<221> MODJRES 

<222> (181).. (184) 

<223> Variable amino acid 
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<220> 

<221> MOD_RES 
<222> (187) 

<223> Variable amino acid 
<220> 

<221> MOD_RES 

<222> (189) . . (190) 

<223> Variable amino acid 

<220> 

<221> MOD_RES 

<222> (192) . . (193) 

<223> Variable amino acid 

<400> 12 

Arg Xaa Phe Leu Xaa Xaa Asn Xaa He Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 15 

Phe Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Trp Xaa Xaa Ser Asn Xaa 
20 25 30 

Xaa Xaa Xaa He Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Xaa Xaa Leu Glu 
35 40 45 

Xaa Leu Asp Leu Xaa Asp Asn Xaa Xaa Leu Arg Xaa Xaa Xaa Pro Xaa 
50 55 60 

Thr Phe Xaa Gly Leu Xaa Xaa Leu Xaa Leu Xaa' Leu Xaa Xaa Cys Xaa 
65 ' 70 75 . 80 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Phe Xaa Gly Leu Xaa Xaa Leu Gin 
85 90 95 

Tyr Leu Tyr Leu Gin Xaa Asn Xaa Xaa Xaa Xaa Leu Xaa Asp Asp Xaa 
100 105 HO 

Phe Xaa Asp Leu Xaa Asn Leu Xaa His Leu Phe Leu His Gly Asn Xaa 
115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Arg Gly Leu Xaa Xaa Leu Asp 
130 135 140 

Arg Leu Leu Leu His Xaa Asn Xaa Xaa Xaa Xaa Val His Xaa Xaa Ala 
145 150 155 160 

Phe Xaa Xaa Leu Xaa Arg Leu Xaa Xaa Leu Xaa Leu Phe Xaa Asn Xaa 
165 170 175 

Leu Xaa Xaa Leu Xaa Xaa Xaa Xaa Leu Ala Xaa Leu Xaa Xaa Leu Xaa 
180 185 190 

Xaa Leu Arg Leu 
195 



<210> 13 

<211> 1176 

<212> DNA 

<213> Homo sapiens 

<400> 13 

gagggcatcc ccgtggacag cgagcgcgtc 
cagcccggcc acttcagccc cgccatggtc 



ttcctgcaga acaaccgcat cggcctcctc 60 
accctgtgga tctactcgaa caacatcacc 120 
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tacatccacc ccagcacctt cgagggcttc 
aaccggcagc tgcggacgct ggcacccgag 
ctctacctct acaagtgtgg gctcagcgcc 
agcctgcagt acctctacct gcaggacaac 
gtggacctgg tcaacctcag ccacctgttt 
ccgggcacct tccggggcct ggtgaacctg 
cagtgggtcc accacaaggc attccacgac 
aacaacagcc tctcggagct gcagggtgag 
ctccgcctca acggcaaccc ctgggactgt 
ctgcagaggt tccggggctc cagctccgct 
ggccaggacc tgaagctgct gagggccgag 
ccgcaccaga tcaagtcaca cacgctcacc 
cactcacccc acggccccac caggagcaag 
aggaagccgg ggaagaactg caccaacccc 
gccgggaaac aggcccccga gctgccagac 
tttgacatca tgcctacggc ccggcccaag 
atccgtgccc ccagcggggt gcagcaggcc 
ctggcctgga cactggggct ggcggtcact 



gtgcacctgg aggagctgga cctcggcgac 180 

accttccagg gcctggtgaa gcttcacgcc 240 

ttgccggccg gcgtctttgg cggcctgcac 300 

cacatcgagt acctccagga cgacatcttc 360 

ctccacggca acaagctgtg gagtctgggc 420 

gaccgtcttt tgctgcacga gaaccagctg 480 

ctccgcaggc tgaccaccct cttcctcttc 540 

tgcctggccc cgctgggggc cctggagttc 600 

ggttgtcgcg cgcgctccct gtgggaatgg 660 

gtcccctgtg tgtcccctgg gctgcggcac 720 

gacttccgga actgcacggg accagcgtcc 780 

accaccgaca gggccgcccg caaggaacac 840 

ggccacccgc acggcccccg gcccggccac 900 

aggaaccgca atcagatctc taaggcgggc 960 

tatgccccag actaccagca caagttcagt 1020 

aggaagggca agtgtgcccg caggaccccc 1080 

tcctcggcca gttccctggg ggcctccctc 1140 
ctccgc 1176 



<210> 14 

<211> 392 

<212> PRT 

<213> Homo sapiens 

<400> 14 

Glu Gly He Pro Val Asp Ser Glu Arg Val Phe Leu Gin Asn Asn Arg 
1 5 10 15 

He Gly Leu Leu Gin Pro Gly His Phe Ser Pro Ala Met Val Thr Leu 
20 25 30 

Trp He Tyr Ser Asn Asn He Thr Tyr lie His Pro Ser Thr Phe Glu 
35 40 45 

Gly Phe Val His Leu Glu Glu Leu Asp Leu Gly Asp Asn Arg Gin Leu 
50 55 60 

Arg Thr Leu Ala Pro Glu Thr Phe Gin Gly Leu Val Lys Leu His Ala 
65 70 75 80 

Leu Tyr Leu Tyr Lys Cys Gly Leu Ser Ala Leu Pro Ala Gly Val Phe 
85 90 _ 95 

Gly Gly Leu His Ser Leu Gin Tyr Leu Tyr Leu Gin Asp Asn His He 
100 105 110 

Glu Tyr Leu Gin Asp Asp He Phe Val Asp Leu Val Asn Leu Ser His 
115 * * 120 125 

Leu Phe Leu His Gly Asn Lys Leu Trp Ser Leu Gly Pro Gly Thr Phe 
130 135 140 

Arg Gly Leu Val Asn Leu Asp Arg Leu Leu Leu His Glu Asn Gin Leu 
145 150 155 160 

Gin Trp Val His His Lys Ala Phe His Asp Leu Arg Arg Leu Thr Thr 
165 170 175 

Leu Phe Leu Phe Asn Asn Ser Leu Ser Glu Leu Gin Gly Glu Cys Leu 
180 185 190 

Ala Pro Leu Gly Ala Leu Glu Phe Leu Arg Leu Asn Gly Asn Pro Trp 
195 200 205 
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Asp Cys Gly Cys Arg Ala Arg Ser Leu Trp Glu Trp Leu Gin Arg Phe 
210 215 220 

Arg Gly Ser Ser Ser Ala Val Pro Cys Val Ser Pro Gly Leu Arg His 
225 230 235 240 

Gly Gin Asp Leu Lys Leu Leu Arg Ala Glu Asp Phe Arg Asn Cys Thr 
245 250 255 

Gly Pro Ala Ser Pro His Gin lie Lys Ser His Thr Leu Thr Thr Thr 
260 265 270 

Asp Arg Ala Ala Arg Lys Glu His His Ser Pro His Gly Pro Thr Arg 
275 280 285 

Ser Lys Gly His Pro His Gly Pro Arg Pro Gly His Arg Lys Pro Gly 
290 295 300 

Lys Asn Cys Thr Asn Pro Arg Asn Arg Asn Gin lie Ser Lys Ala Gly 
305 " 310 315 320 

Ala Gly Lys Gin Ala Pro Glu Leu Pro Asp Tyr Ala Pro Asp Tyr Gin 
325 330 335 

His Lys Phe Ser Phe Asp lie Met Pro Thr Ala Axg Pro Lys Arg Lys 
340 345 350 

Gly Lys Cys Ala Arg Arg Thr Pro lie Arg Ala Pro Ser Gly Val Gin 
355 360 365 

Gin Ala Ser Ser Ala Ser Ser Leu Gly Ala Ser Leu Leu Ala Trp Thr 
370 375 380 

Leu Gly Leu Ala Val Thr Leu Arg 
385 390 



<210> 15 

<211> 143899 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> modified_base 

<222> (2044) . . (2144) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (6609) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (6625) . . (6724) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modifiedjoase 

<222> (14153) . . (14252) 

<223> a, t, c, g, other or unknown- 

<220> 

<221> modified base 



25 



WO 02/29059 



PCT/US01/31488 



<222> (19512) . . (19611) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (22595) . . (22694) 

<223> a, t, c, g, other or unknown 



<220> 

<221> modified_base 

<222> (27825) . . (27924) 

<223> a, t, c, q, other or unknown 



<220> 

<221> modified_base 

<222> (34953) . . (35052) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (40783) . . (40882) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (49000) . . (49099) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (62884) . . (62983) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (75528) . . (75627) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (87944) . . (88043) 

<223> a, t; c, g, other or unknown 

<220> 

<221> modified_base 

<222> (111030) . . (111129) 

<223> a, t, c, g, other or unknown 



<400> 15 

aagcacatac aggtgacatt acagaactga cagttatgcc aggcactgta cttagcccct 60 

ataccatcct caaacagctg tatgatgtag attgggtatt aaccccatta ataacaaaag 120 

tacagggaac aaagtgactt tccaaaggtc atgccattca aaggagggtg aatcttaggt 180 

tggacgcagg ctgtctgact ctggagtctg aggtgttaat gctgcctcct ccatgggaac 240 

agcccaagtg aaaaacagct gatccactct tcatttactt ggcatctgtg ctaagctggt 300 

ccctgagcca agctctgagc aacagaaaca gaagctctgc attaggagct tgtgagcatg 360 

tcaatgccgg gtaaaggagt gctggaaacc gctgggatgg ccgccgagca ctaggccgtt 420 

gaaggtgggc tctgtgtgac tggttcctct acactctggc ctggctgcct gcaggaagaa 480 

gatcaagctg agtgggctgg ccctggacca caaggtgaca ggtgacctct tctacaccca 540 

tgtgaccacc atgggccaga ggctcagcca gaaggccccc agcctggagg acggttcgga 600 

tgccttcatg tcaccccagg atgttcgggg cacctcagaa aaccttcctg agagtgagtg 660 

tctggtcaag gtgccggcct tgggggatag tgatggtggg tcctcatatt cagtgagcac 720 

tcatggttga gtatttattc gcacccctct tcagtcctta caacacccca tgatgtaggt 780 

ggggcatgct cctcatttac agatgggcac atcaaagctc agctaacgct gggaagttca 840 

gattcagggt taccctgctg gattcctggg attggggagg gaggagcttc caaaatgggg 900 
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acaaggtctc tgggcctgtc gggtagctgg tttcctcagg gccccttgca acctctgagc 960 

ttattgcatc aggtgcagcc aggcccgtga gcctcctggc aggggtcctc cacacctggc 1020 

tgtcttttgc cccctgctgg tcacaggagg agctgcagca cctgcctggg ctgcttctca 1080 

ggagggtaca tgaagatccc aggaccgcca gctccatgat aagtggaagg agctccttgg 1140 

agtcaggagc gggagttgag gagtttgagt cctgctctcc agttataggc tatgtgactt 1200 

gtgtagatca cctaaccttg ctcttgattt ccttacctct taaactagca ctaaaagcac 1260 

cccacaaact gtaaagttag ttgtgatgat tgaatgacac catgggtgtg gaagctcttt 1320 

gtaaagtgca aaacggtgtg cagtttgagg gtggttaccc ccagtgccga ttctcagagg 1380 

gcaacatggc taagggcacg agctggagtt aggctgacct gctgcttcca gccctgtgag 1440 

cttgagcaag tcatttaact tcctgagctg cagtttcctc atcagtaaaa tgtgataagg 1500 

atagggttgt tgtaagattt tattaaatgg ggtaataaat gtcaagtatg tagcccatag 1560 

tgagtgcttc agagtttttt tcttttgttt ctttcccccc cgccccgaga tggagcctta 1620 

ctctgttgcc caggctggag tgcagtggca tgatcttggc tcactgcaac ctccgcctcc 1680 

cgggttcaag caattctcct gcctcagcct cccaaatagc tgggactaca ggcgtgcacc 1740 

accatgctcg gctaattttt gtatctttag tagagacggg gtttcaccat gttggccagg 1800 

ctggtctcga actcctgacc tcatgatgct cctgcctcag cccccgaaag ttttgggatt 1860 

acaagtgtga gcccccgtgc cctgccaggt tttttttttt tttttttttt tgtaaaacac 1920 

ccacagggta ttgctgttgc ctgggctgga gtgcggtagt gcaatcatag ttcactgcag 1980 

ccttgacctc ctgggctcaa gtgatcctcc tgcctcagcc tcctgagtag ctgggaatac 2040 

aggnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2100 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnttttgta ttttaagtag 2160 

agacagggtt ttcccaatgt tggccaaggc tggtctaaaa ctcccaacct caggtgatcc 2220 

acccacctca gcctcccaaa gtactgggat tacaggcgtg agacaccgtg cccagccagg 2280 

aggcttattt tcttgataaa ttacccagtc tcaggtattt ctctacagcg atgcaagaac 2340 

agcctaatac atccaggctc agcatcagtg gacccaggtg ggagagctta agatgtcaag 2400 

gtctgaatgc cgcttccaca cacctttggg acctagggac tccctctctt tttctttttt 2460 

cagtagaaga tgttatcttc tcctttctct gaccagtagt- tggtgatggt ttcagagata 2520 

gtttttcagt caagatatat ttcagtggct tcactgagcc caagttccct cgcctctcta 2580 

ggactttatt tccttgtttc tagaagaggg ataacacata ttttctaagg tggttgtgag 2640 

attaagggag ctggtaccgg gtggtgcata aggacaggat agagcaatgg tgagaccact 2700 

caaaaagcga aaagttgacc tgcgagggtg acacttatca aatcagcaca cagtgggagt 2760 

ggaaggaatg tccctcatca gttacaatat ttggagagtg caagttatag aaaacccagc 2820 

cctggccggg cgcggtgggt catgcctata atcccagcac tttgggaggc tgaggcaggt 2880 

ggatcacgag gtcaggagtt caagaccagc ctgaccaacg tggtgaaacc ccacctctac 2940 

taaaaataca aaattagctg ggcgtggtgg tgtgtgcctg taatctgagc tactcaggag 3000 

gctgaggcac gagaatcact tgaaaccggg aggtggagtt tgcagtgagc cgagatcgca 3060 

ccactgcact ccagcctggg caacagagcg agactccatc tcaaacgaaa aaaaaaaaag 3120 

aaagaaaacc cagctctaac tggcttaaac agtaagaaga tctattatat tatccatctc 3180 

aggcagcagc aagcccagag gtaggggact ccaaggttgg ttgatccagg gcttaacgat 3240 

gtcatcaaag acccaggttc tttctgtctc ggcacctctg tctgcagggc cagcttcatc 3300 

ctaagccaga ttgttcttgt cttgattaca agttggctgc tgggccagca gacgctgcct 3360 

gcctccctgt tcatcttcag aagtagaaag tggcccttcc ccagtcatgg aatgaaagag 3420 

tttcctttct gtctgggatt gcttaggtcc acccacctga agccaatgac tgtcaccagg 3480 

aaggtaatat acactgattg tcttaagtca gggttcctga gccagtcttg ggcaaggagt 3540 

gtgatactgt catgattgtc ttgggctcat cagggcagct ctgcagatga gatcaaactc 3600 

caagctacat tattctgaac agtgggaagt aggaaagaga cattttggga gatacaaaac 3660 

acaatgtcta tcccatatcc ctaggtccag gtcacagtgt cttggttgga catcaaatgt 3720 

agaaaaagaa agactgtcca tccatttatc tacctattca tctggttttt gatttttttt 3780 

aaattttatt ttaagacatt ctcactctgt cacccagact ggagtgcagt ggtttgatca 3840 

tggctcatgg cagcctcaac ctcccaggct caagtgaccc tcccatgctc aagtgatcct 3900 

cctacctcag cctcccaagt agctagaact aaaggtgcat gccaccacgc tcagttaatt 3960 

tttgcatttt ttgtagagat ggggtttcgt catgatgccc atgctagtct ggaattcctg 4020 

aactcaagca atatgcctgc ctttgcctcc caaaatgctg ggattgtagg catgagccac 4080 

tgctcctggc tcatctgttt aataatttat gaaacaacta ctgggtgctg agcacggggc 4140 

caggggctgg agatctagca gggaccaggc agatctctgc caagtcgttg gtttcttaaa 4200 

ggttttgctc ataattcccc ttttcttttc tctttcgttt tttttctttt ctttctttct 4260 

ttctttcttt tttttttttt gagacagagt ctcactctgt tacccaggct ggagtgcagt 4320 

ggtgcgatct cagctcactg caacctctgc ctcctgggtt caagcgattc tcctgcctca 4380 

gcctcccgag tagctgggac tacaggcgcc tgccaccatg cccggctaat ttttgtgttt 4440 

ttagtagaga ctgggtttca ccatattggc caggctggtc ttgaactcct gaccttgtga 4500 

tccgcccgct tcggcctccc acagtgctgg gattacaggc gtgagccacg gcgcccagcc 4560 

agtttccctt ttcaatgagg cctccctgac ctccatactc tactcctcca cctggcccac 4620 

tcagctctac tttttcttcc ccatagcact caagacctcc taacatacta cgtaagttat 4680 

ttatttacta ggcttactgt gtattgtctg tcttcctcta ctagaatgta aactccatga 4740 
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gaatagaaat ttttgccttt ttatttagtg tggtgtctgc agcccctggc ttagtccctg 4800 

gcatacaaca gtcactccac ccacagttgc tgaataagtg actaaaggtc cctgccctca 4860 

tattgttatg agggagtgtg catgttgtta gagaaaaatc tgaggcacaa taaaatttta 4920 

tagagtttaa gttttctttt ttaagcaatc cacgaattgg ggtagtttca. gaggtagttt 4980 

ttcagtcatg acgtatttca atggcttcac tgagcccaag ttctttcacc tctctaggac 5040 

tttatttcct tatttctaga acggggataa cacatagttc ataaggcagt tatgagagta 5100 

agggagctgg tatggggtga tgcataagga caggatagag cagtggtgag accgctcaga 5160 

tgacaaagcg tcagagacca gtatttacga cggaaatgtg gaagcatgat aaagaaatta 5220 

tttgggctgg gcacaatgac tcacaactaa taaaactttg ggaggccaag gtgggaggat 5280 

cacttgactt gcagaaggtc aaggctgcag tgagctgtga ttttgccact gcactccagc 5340 

ctggtcaaca gagtgagacc ctggctcgaa acgttatttg attggttaca gttatacagt 5400 

tgccttattt ggtctattcc atttgaaagt tcctagttct ataattttaa gtttgttggc 5460 

tgtttctgat tggttaagct taagttttgt tttcctttaa tacagttaag tgccccataa 5520 

tgacattttg gtcaaggaca gaccacatat acagtggtgg tcccataaga ttataatgga 5580 

gctgaaacat tcctattgtc tatggcgtag tggtcctgat gttgtagcgc aatgcattag 5640 

ttatatgttt gtggcaatgc tggtgtaaac acacctactg cactgccagt gatataaaag 5700 

aatagcacat acagttatat atagtacata atatctgata atgataatac ataactatat 5760 

tactggttta tatatttact atattattta tctttatttt atttttgaga cagagtctca 5820 

ttctgtcacc caggctggag tgcagtggcg cgatcttggc tcaccgcaac ctccgcttcc 5880 

tgggttcaag tgattctcct gcctcagtct cctgagtagc tgggattaca ggtgtgcacc 5940 

atgacaccct gctaatatgt tttgtatttt tagtagagat ggggtttcac catgttggcc 6000 

aggctggtct tgaactactg acctcaagtg atcaccccgc ctcggcttcc caaagtgctg 6060 

ggattacagg cgtgagccac cacgcatggc ctatttataa ttattttaga gtgtacgcct 6120 

tatacttata aaaaaaagct aactgtcaaa cagcctcggg caggtccttc aacagatatt 6180 

ccagaagaca ttgttatcat aggagatgac agctccgtgc atattattgt ccctgaaaac 6240 

cttctagtgt ggaagtggaa gacagtgata ttgatgatag gacccagtgt aggcctaggc 6300 

taatgtgtgt gtttgtgtct ttgcttttaa caagaaagtt- taaaaagtta aaataaaata 6360 

caaaaatttt taaatagaaa aaagctgccc aggaacaatg gctcacacct gtaatcccac 6420 

cattcgggga ggccaaggtg ggtggattgc ttgagctcag gagttcaaga ccagcctggg 6480 

caacatggtg aaaccccatc tctacaaaaa atacaaaaat tagccgggtg tggtggcatg 6540 

cggctatagt tccagctaat cgaggggctg aggtgggagg atcactgggg gggaggtggt 6600 

tgaggctgna gtgagctgtg attgnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6660 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6720 

nnnnatattc ttaaaaaaat ttttttttat ttttgagaca gaatttctct cttgttgccc 6780 

aggctggagt gcaatggcgc tatctcagct cagggcaacc tccacctcct gggttcaagc 6840 

gattctcctg ccttagcctc ccaggtacag gcgcccgcca ccatgctcgg ctaatttttg 6900 

tatttttagt agagatgggg tttcaccatg ttgtccaggc tggtcttgaa atcctgcctc 6960 

aggtgatcca cccccctcgg cctcccaaag tgctggaatt tacaggcgtg agccactgtg 7020 

cctggcctcc tttacatttt tttaaattta attttaattt tttaattttt aatttctcat 7080 

atatatatat ttttaagact agccaagtga agcagtggga gtggaaaagg aactggtttt 7140 

gatcaatagg tgtaaacacc actgcactgg gaccagccta ttttacattc ctgttagcag 7200 

tgatgagggt tcactttctt tgtagcctca acaatatgtg tcgttgccca tctttttttt 7260 

tttttttttt tttttttttg agatggagtc tcactctgtt gcctaggctg gaatgcaatg 7320 

gcatgatctc agctcactgc aacctccgcc tcccaggttc aagtgattct tgtgtctcag 7380 

cctcctgagt agatgggatt acaggcgtcc accaccacgc ccggctaatt ttttgtattt 7440 

tcagtagaga tggggtttca ccatgttggc caggttggtt tcgaactcct gacctcaagt 7500 

gatccgccca cctcggcctc ccaaagtgct gggattacag gcatgagcca ccgcgcccgg 7560 

cctgcccatc ttttttttgt tatagccatc ctagtggatg taaagttttt ttgtgatttt 7620 

gatttgtgtt tccctactga tcaatgatgt tgagcatctt ttcctgtgct tattggcttt 7680 

tggtatatct ttggagaaag gtctattcag gtcctttgcc cactttaaaa ttaggttatc 7740 

tttctattac tgagatgtaa gagttcttta tgttctagat ataagtctcc tacatatgat 7800 

ttgtaaaaat tttccttcca ttattgggtt gtctttcact ttcttttggt gtcctttagt 7860 

gcacaacagt ttttaatatt gaagtccaat tttctatttt tctcttttgc cacttgtatc 7920 

ttggtgtcat gtttaaggaa ctattgccta atctcaggtc acaaagattt acacctgtgt 7980 

ttccttcttt ccttccttcc ttccttcctt ccttctttcc ctccctccct ctctccctcc- 8040 

ctccctctct ccctccctcc ctccttccct tcctccctcc ctccctcctt ccttccttcc 8100 

ttccttcctt ccttccttcc ttccttcctt ccttcctttg tccttctgac ggaatcttgc 8160 

tctgtcaccc aggctggagt gtagtggcac' gatcttggct cactgcaacc tctgcctcct 8220 

gggttcaagc aattctcctg cctcagcctc ctgagtagct gggactacag gcacacacca 8280 

ccatgcccag ctaatttttg tatttttagt agagacgggg tttcaccaca ttggccagga 8340 

tggtttcgat ctcctgacct cgtgatccac ccgccttggc ctcccaaagt gctgggattg 8400 

caggtgtgag ccaccatgcc cggcctgtgt tttcttagag ttttgtagtt ttagctctta 8460 

tagttagatc cttgatccat tttgagttga ttttgtatat agtgtgagat atccacctgg 8520 

tgttgtaaat tgcccagaag tgggtatgct tctaaatctg gctgttaggg attactagag 8580 
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gtgaccaaag tgaatttttt ctttgtttct tttttttttt ggagacagag tctccgtcac 8640 

ccaggctgga gtgcaatggc ttcatcttgg ctcagtgcaa cctctgcctt ctggtttcaa 8700 

gcagttctcc tgcctcagac tcctgagtag ctggtattac aggcgtgtac caccatgctt 8760 

ggctaatttt tgtattttta gtaaagatgc agtttcacct gttggccagg cttttctgga 8820 

actcccggcc tcaagtgatc catctgcctc tacctcccaa agtgctggga ttacaggtgg 8880 

gagccaccgt gcccagtcct tttctcagaa tttatttgtt tttttttgtt ttgtttcatt 8940 

tttgagatag ggtctcactc tgtcagctag gcaggagttc agtggtgtga tcattgctgc 9000 

agccttgaac ttctggactc acgtgatctt cccacctcag cctcctgagt agctaggatt 9060 

acaggcatgt gcttccacac ctggctaatt ttttaatttt ctaggactta tttgtccatt 9120 

cttgcaaagc agggtacaac atgcctatct ctacctacct ctcttccctt caagggactc 9180 

cagccaaaat ccttgaggct ctcgggctga ctgtgggtgc tgttgcctga tctgcctcag 9240 

tcatgctgca tgatcaaaag tgtccgtttt ctgcttcttg gaactttatt cactttgggt 9300 

gtcagtcttc ctctgcagtg tcccaagaac acagaattag accaggaatc tgtgttgcca 9360 

tagtgtgtgg aaagaggcag acttccaact ccgctatgtg ctgttgggtg attgaagctt 9420 

aattttcttt ctatctttct ttcttttctt ttcttttttt ttttttggag atggaatctc 9480 

gctctgttgc ccaggctgga gtgcagtggt gcgatctcac ctcactgcaa cctccgcctc 9540 

ccaggttcaa gcgattctcc tgcctcagcc tcctgagtag ctgggattac aggtgcatgc 9600 

caccatgccc ggctaatttg tgtaatttta gtagaaacag tgtttcacca tattggtcag 9660 

gctggtctcg acctcctcac ctcaggtgat ccacccgcct tggcctccca aagtgtcggg 9720 

attacaggcg tgagccaccg tgcctggcac ttaattttct taatacctca attaccccat 9780 

atggtaaaat gggactagta atccatacct tatagcgctg ttgtgaaaat gaaatgaggg 9840 

taagcagata aaatttcaga ctacggatgg gattgttact acattctgaa cctggctttg 9900 

ctgttatttg ctatgtgacc ttatcttctc tggatctcca ttctttccaa gtctataaaa 9960 

caaagtggac aattgtcaac ctttcttcca aagagcaatg atttaaggat caaatgatgt 10020 

catttaacaa aaatatgaag agctcaacaa atgaggaact cattattatt attacaatta 10080 

ttattatttt agaaataggg tcttgttctc ttgcctaggc tggagtccag tggtataaac 10140 

acagctcaat gcatcttcag cctcctggat acaagtgatc ctcatgtctc atccccctaa 10200 

gtagctggga ccacaggcat gtaccaccac gcacggctaa ttttttattt tttattttta 10260 

ttttttgaga cagtcttgct ttgtcgccca gactggagtg cagcagcgca atcaccgctc 10320 

actgcaacct ccgcctcctg ggttcaagtg attctgctgc ctcaacctcc caagtagctg 10380 

ggattacagg cctgtgccac catgcccggc taattttttt gtatttttgg taaagacggg 10440 

gtttcaccat gttgcccagg ctgatctaga acccctggcc tcaagtgatc cccctttctt 10500 

ggcctcctaa agtgctagga ttacaggcgt gagcctctgc acctggcctc ggctaatttt 10560 

ttattttttg tagagacagg ttctcactat gttgccaggg ctggtcttga actcctgggc 10620 

tcaagtgatc ttcccacctc agcctcccaa agtgctgaga ttacagatgt gagccactgt 10680 

gcctggcctg gaactcatta ttgaagcatt cactagtatc aactttgggg ttacctggcc 10740 

acatcctctg acctacctat aagggtatca cagctaacgg agcctctgtt tctcagaatt 10800 

taggcagaag cagttcaatt tatcacaaac tactctatat ccagcataag tgcccaaata 10860 

aaacaattgc taaagttctt taggcattta ctgtttgtta gttagatatt tagtcctcac 10920 

tacaaatctg tgatacaggt attattttta ttaaccccat tttatagaag agaaacctga 10980 

agctcagaga tgctaagtaa cttgtgcaag gtcacacagc tagtaaataa agggcagagt 11040 

aaagatttag tttcacattg gactccagaa cctttctact gggactcatg ggaatagtgt 11100 

ggatgtccct gaccttcagt ggcccagggc tctcctgggg gaatccagcc atagacaaga 11160 

caccagcgag agcccaatcc taagattttg tttgtttgtt tttgagacaa ggtctcactc 11220 

tgtcaccaga ctggagtgca gtggcatgat caatgctcac tgcaaccttg atctcccagg 11280 

ctcaagcaat cctcccacct cagcctcctg agtagcttgg actacaggtg cacaccacca 11340 

cacctgacta attttaaaat tttatttaat taattactta ctattatttt ttgagacagg 11400 

gtatcacttt gtcacccaag ctggactgca atggtgtggt ctcagctcat tgcgtcctcc 11460 

acctcccagg ttcaagtgat cctcccacct cagcctctgg agttgcaggg actgcaggtg 11520 

tgcgccacta tgctcagcta atgtttttat tttttgtata gatggggtct cactatgttg 11580 

ccagggctag tctcaaactc ttggactcaa gcgatcctcc tgtcttggcc tcccaaagtg 11640 

ccgggattac aggcataaac caccacaccc aacccctaag gtgtttttgc tgaatgtgac 11700 

catgtcagag gcaggaaagg gaagcatcat ggggttagga aaggaacact gagcagggag 11760 

acaaagaaaa tgggatcatt ttgtgagtgt tcgctgtgtg tgtatgtgtg acaattctca 11820 

gagccagcct ctcaggtggt tgagaccaca gtccccattt cccagatgag ataatggagc 11880 

ctcagagagt ttctgcagca cagctagtgg aattagaatt tgaacccggc tcttccagac 1194 0 

tccaggtgct tcacaaccat cccaaaccta gtcatttgca gtttaccttc atgattttac 12000 

catttccctt tgccatagct agtgttattt acttaataat tccttttgaa tcagtctgct 12060 

taaaaaaaaa tagcttcatt ctaaagtgta atattcttgg aatatcgggt ttgctgttac 12120 

ccacccccac acgttataca tatacatgta tgtttctaat acatatatat gtacgtatat 12180 

acgtgtatcg ttttttgtta ttttttttgt tgttgttagt tttttttaga tggagtctct 12240 

ctctgtagcc caggctggag tgcagtggtg tgatttcggc tcactggaac ctctgcctcc 12300 

tgggttcaag cgattctcct gcctcagcct ctggagtagc tgggattaca ggcacccacc 12360 

actacacccg gctaatgttt gtatttttag tagagacagg gtttcaccat gttggccagg 12420 
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tgggtcttga actcctgatc tcaagtgatc 
ttataggtgc gagctactgc ggctggccaa 
aataactatg aaacctgaaa aactgctcca 
acatgctgtg cataccacat attgggaaac 
aatcactgtc ccatcctatg cttcgcttta 
atgccaaaga ccttttctgt tgtcattaac 
ttgggctggc tctcctttta gttcagaatt 
aagccttcct tatgaatgaa ctcgttggtc 
gaatgaatgt atgaattgac actaagtcat 
gtggaagagc acatgtcctt tctggcactg 
tggcccatgt aacacagagg ccctttggtg 
cttgagtttg aatctgagcc tgcctgtaag 
ggacccttac catttataaa atggggacca 
attaaatggg attctctata gaaaatattg 
tgggattgtt tgtgagggaa attgtcattt 
gtctaattca gggactgcaa atggaaatgc 
ggtaaagcag ggcaggtgta gaatagcaat 
gtatgacctt gagaaggcaa gccctggcaa 
tgcctagatc ttctaacttt ttatttttat 
tcacccaggc tggagtgcag tggcgcgatc 
tcacgccatt ctcctgcctc agcctcccaa 
gcccggctaa tttttttttt gtatttttag 
atggtctcga tctcctgact ttgtgattta 
acaggcttga gccaccgcac ctggccgatc 
tggattttta tgtgataact cctgatttta 
ataagggtca acattgccag cagagcaaaa 
gtgcagcctc tggtctaaaa tcatctttgt 
acagtgctgt tgtatagtaa atatccagta 
ataaacctgt agagaggccg ggcacagttg 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
gggtagatca cctgaggtca ggagttcaag 
ctctacaaaa ataaaaaaat tatctgggca 
cgggaggctg aggccggaga attgcttgaa 
atcatgccat tgcactccag cctaggtgac 
aaaaaaaaag aaaaaagaaa agaaaaagaa 
attggttcct gagagtagcc aaccttattg 
cctggcttgg gccaatgaat gaggaagaat 
tccaccacta gggaggtctc aagcttgaaa 
tggccagtcc tcctacagtc aagtagataa 
atagtgatca aggccgttac tctgcaatca 
catttgtggg tcttttggaa gagcagagaa 
gggctgcctg tccctgctcc ctgagccatg 
ctgtactttt tttttttttt ttttttgaga 
tgccgtggcg tgatctctgc tcacagcagc 
acctcagtct cctgagtagc tgggattaca 
gtatttttag tagagacagg gtttcactgt 
tcaggtgatc cacccgcaca tgcctcccaa 
gcccggtcct accaatctgc acattttaat 
ggcatagttc tgagaagcat cccacaagga 
agctcttggc agagacatct ggcaaattca 
gtgcagcaaa ttttgtctag agcttggtca 
atttcaattt ttaagaaatg ttgcattaaa 
agtgtcccct taatttctta gtgtgtctga 
cagaccctgg ggtgacagaa gactggtggg 
ggagaggatg atctacagag tgctggagag 
ttgccagccc tgaagctatg ccttgaggtt 
cctcaactgg ggccattttt ctccccagag 
atcatctgcg ggggtgggga gaggggctac 
cattaaactt tctacaacgc ccaggacagc 
catattaata gcacaaagtt gaggaacctt 
gtagattttt cacacattaa agatgagaaa 
ggctgtggca ggaggattac ttgagcccag 
agcactgcac tccagcctgg gtgacagagc 



cacctgcttt ggcttcccaa agtgctggga 12480 

tgtatgtttt taatacacat tcaaataacg 12540 

tgttacttcc tgaacccatc ttgagtgctc 12600 

actgctttcc ctggcttcca agcccagctt 12660 

tttgtctata aatgttgggg ttgggggttg 12720 

atggacacag ctctaagagg tcttggcatc 12780 

tggattttta tccaactact cagagtgatc 12840 

aaactcataa aaggctgatc gataaaacag 12900 

tagcatttca cgggaatgga ttctccgtta 12960 

atgtgtgctt gggaaactta ctgagctaac 13020 

cagtggaaaa ctgttgactt tggagattat 13080 

aagctggcta actgaattgc tttgcttctt 13140 

ttgtactcac cctttagggt tattgcatgg 13200 

gcacaaagta ggtgtaaatt tgcacgctag 13260 

gattatcaaa gacttaggag caggaacagt 13320 

cagctgaggc caggcatttg ctaataattg 13380 

gtctgggaat taaaagagag gtgaggacgt 13440 

aaggggatgg cctccactca gctacagtca 13500 

ttttattttt tgagacggag tcttgctctg 13560 

tcggctcact gcaagctccg cctcccgggt 13620 

gtagctggga ctacgggcgc ccaccaccat 13680 

tagagatggg gtttcaccgt gttagccagg 13740 

ccctccttgg cctcccaaag tgctgggatt 13800 

ttctaacttt ttaaagagaa gcaagacatc 13860 

aactggcacc caattataat ttacaacact 13920 

catgggtggg ggcaactgct ggtcaccggt 13980 

atttcttctt- gctttacgca ttgtcccagc 14040 

agtgggtgta gaatgaataa accaatgcag 14100 

ctcatgtctg taatctcagc acnnnnnnnn 14160 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 14220 

nntagtccca gcactttggg aggccaaggt 14280 

accagcctgg ccaatatagt gaaaccccgt 14340 

tgattgcagg tgcctctaat cccagctact 14400 

cctgggaggc ggaggttgta gtgagccgag . 14460 

ggagcaagat tctgtctcaa aaaaaaaaaa 14520 

acaatgaatg agtgtgaggc tcatggtagt 14580 

gtcatcccag ccacgaagtg aaatggtacc 14640 

aatggcaaat gggggtctat gcctccaccc 14700 

tccagtgacc aggtttttag gtcctggacc 14760 

gtggagggtt tggtccgttg ggctacggag 14820 

gactcagaaa tggcctctca gttacttctc 14880 

gaggaaggaa tttaggtctt ctcaccctct 14940 

gagggctggg gtggaatatg gggaataaat 15000 

cagagtctcg ctccgtcgcc caggctggag 15060 

atctgcctcc cgggttcaag ttattcttcc 15120 

ggtgcccacc accacgcccg gctaattttt 15180 

gttgggcagg ctggtctcaa atacctgacc 15240 

agtgctggaa ttacaggcat gagccaccgt 15300 

tgacaagggt caccctccac tcatgtgcca 15360 

tgcctctgag ttcaccctga caagtccact 15420 

aggcttgaga catgctggcc tctctttaaa 15480 

gttaaaattt tgatgttttg ttttgcatta 15540 

atgttattta tcttgaatag taaatttctt 15600 

gttgagagcc tcccctgcct gattctagtc 15660 

agatgggagg tgaggagggg agtgttggtt 15720 

actctgtatg gagcttttca tgctgcctgt 15780 

gggcaaggtg gcatatccta gatcagagat 15840 

gacatttgga aacatgtgga gacatttttg 15900 

tgacatctgg tgagtagaga ccagagggac 15960 

ccctccacaa taaagagtta tttgacctca 16020 

gatctagatc cacagcacag aagaaaggat 16080 

gcttgtgcct gtaatccctg tgactcagga 16140 

gaattcaggg ttacagtgaa ctatcatcgc 16200 

aagattttgt ctcttaaaaa aaaaaaagat 16260 
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gaggacaggc acagtggctc atgcctgtaa tcccagcatt ttgggaggcc gaagtgggtg 16320 

gatcacgagg tcaggagttc aagaccagcc tggccagcat agtgaaaccc catctctact 16380 

aaaaatacaa aaaattagcc agctacttgg gaggctgagg caggagaagc gcttgaaccc 16440 

gggaggtgga gcttgcagtg agccaaaatc ttgccattgc actccagcct gggcgacaga 16500 

gcaagactcc gtctcaaaaa gaaaaaaaaa aaagatgaga aagaggaagg gagagaaaaa 16560 

agagagagag gaaagaaaga gagaaggttt tggagtcaaa aagacttaga aattccagtt 16620 

cttccacttc ccatggaacc ttggcaagtt gccttctctc tttctctgaa tctcacattt 16680 

tgcctctgtg aagtaggggt ggtacctggt ggagatgatg cggagatgag ggtgaggggt 16740 

gtgttgcaca ctatgcccct aggatgggtg agagcttggg agcactgaac ctccctttcc 16800 

cctcttgttt cttcccccca ttgtctccca ccagctccct gggatctcca cttcactctc 16860 

tgggattcca ccagcaggag gctactcctg gagttaaggc gtgttgttca gactggggca 16920 

ttttaggggg cataaataat aattatgcct ggacaatgga cataacatct agggccttct 16980 

gaagcaaacc agggtgtggg gtacccaaac aaggcagtag gccccaggag gcaggtccct 17040 

gcagtcccag cagagagcag ggcacagggt tgagaagact gagcaaactt cattatcagc 17100 

tcctttgtcc cccactctgt cctggagcaa tcattctggc ctcttcccac ttccccaaaa 17160 

acccagtata aaggctgctt ctggcccctg aagccagagg cactgagagt ggaggtctca 17220 

gactcttgga aggtgagttc ttttctggct gcccaggcag gaccagtgta ggccctggga 17280 

agaagcagca cctcataggg caaacacgta ggaggcctgt ccttaggaac atcatagcta 17340 

agcagacctg tccccgcagg ggcaggagtc tgggctaagg gtgatactgg agagcagcaa 17400 

cggagactgg aagacaaatg aaatttggta cctgagttat ccctcccacc attccttttc 17460 

tagactctcc agctcagggt ctgttcatgg caagaggaga aagcaatctt gtttgctctt 17520 

taatcaaaca attaaacaaa tattccctct atactatgtg ccaggggcta tactagacac 17580 

acaaagacag ccccaagaag gacggtggag tagtgtcctc gctaaaagac agtagatatg 17640 

caatgcctct tgctcctgcc ctttctcctg ctgggaacag tttctgctct tcatctgggt 17700 

aagtctctcc cttccctcct catgcgtctt tccctttttt cctttttcct acactcccct 17760 

ccccccgctt ttatttgcac tcatgaggcc aggaccacag ccttccctct ttagctgata 17820 

cagctcatct ccggtaagat atcacttgga ctcagaactg • taacctggaa ctttctcttt 17880 

tttgtttgat ttttttttgt tgttgttgtt tttgtttttt tttttgtttg ttttttgttt 17940 

tgttttgaga cggagtctcg ctctgttgcc caggctggag tgcagtggcg cgatctcggc 18000 

tcaccacaaa ctccgcctcc cgggttcaag caattcttct gcctcagcct cctgagtagc 18060 

tgggactaca ggcacatgcc accacgcctg gctaatcttt gtatttttag tagagatggg 18120 

gtttcaccat atttgccagg ctggtctcaa actcctaacc ttgtgattcg cccgccccgg 18180 

cctcccaaag tgctgggatt acaggcgtga gccaccgcac ccggcaaact gtaacctgaa 18240 

ctttcagaag gaaaaaccac ccacctgtta agatgaaggg ctggtgactg ccccaggctt 18300 

ctcacacgtg ctttctccca ccttcaaaac acacactcgt ggtgtcggcc agaagtcagg 18360 

ttcttgtcca tttgtgggtg tgacccgaga gatctctcct tacctaacac caaggaaatc 18420 

ctccagtctt gtcttcaggt ggaattccta ggaaagctcg agcgacgttg ctggagctgt 18480 

ccacggtgct ggaactagga agctcttgac ctgatggcag gttacctctt cttcccagag 18540 

aatgatgccc cccatctgga gagcctagag acacaggcag acctaggcca ggatctggat 18600 

agttcaaagg agcaggagag agacttggct ctgacggagg aggtgattca ggcagaggga 18660 

gaggaggtca aggcttctgc ctgtcaagac aactttgagg atgaggaagc catggagtcg 18720 

gacccagctg ccttagacaa ggacttccag tgccccaggg aagaagacat tgttgaagtg 18780 

cagggaagtc caaggtgcaa gatctgccgc tacctattgg tgcggactcc taaaactttt 18840 

gcagaagctc aggtaagtag tagggaggct actgcggagg acctggggga aaagagagta 18900 

cattcagtct tctgttccct attcatttag gctagtggtt ctcaaagcct cgcatgcatc 18960 

agaatcacct ggagttgttg ttaaaacaca gctttctggg cctcacctgc acgacttctg 19020 

atttaggagg gctgaggtga agcctgagaa tttgcattta caacaaatcc ccaggtgatg 19080 

atgatattgt tggtctgggg agaaccaccg atttaaacaa aaggctttgg tgttagaaac 19140 

gcctgtgtta aattctggtt ctgcctttta ttagctgtgt tacctgggca agttgctttg 19200 

cctttcaaag ctttagcacc ttcatttgta aaacgaagat atatagcacc aacttcttag 19260 

agttgtggtg agcattaaat gagataatac atgaaaagtg tttggaatag tcactgggct 19320 

gtaataaact ctcaataagc ggtggttata attattatga gtattatcat ttcctgtagg 19380 

attgtcctga cagctaatta agaagcaaaa gataggatta agggaggcaa gtaggtttat 19440 

ttttaacctg aaaagggatg ccgggctctt gcctggagac tcagaaactt gaaataaatg 19500 

agagggaatt cnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 19560 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ngaattctct 19620 

gttagcacat agccagaaca tctagaaggg gtggtaggag tggggattag aggttccagc 19680 

tggaggcaat ggcacttgca aaggctttgt tgaagtggcg taagtgtgga ggtggagcat 19740 

tcaggaaagg agagcttcag cttcagtgtg gctggagtgc tgggtgtgaa gagaggtgaa 19800 

gatgaggctt ggaggctggg cagattttgc tccaaaagag cttggtgaac tgtgataagg 19860 

agtttggatt ttctcctact aaggacaaca gcaaactatt gaagagttta aatcgttcag 19920 

tgacaatgac acgtttgcgt tttggtggct cactcgagct gccagccagg tagacagtgg 19980 

cagaagatgg aagataaagc actaaagggt gatgaggcag gaagccagtg aggagagaaa 20040 

ggggacgatg tgagtgacag taaatcattt gttgggttgc tattgtgtgc taagctctgt 20100 
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gctaaattct tcacgtgtat tatttcagct aatccatcta acaactctgt aaggcaggta 20160 

caatcgttcc cagctgaaga agctgaggct ctcaaaagct agtaacttgc ctaagttcat 20220 

gcagcatgca agttgtccag ccaggattct aacttagaca ccagaggcca cttttaacca 20280 

ctgctctagg actgggggaa atggtcccta gtgagatatg tgtcgagttt catatttcat 20340 

tcaacaatat tgttggcctg ctacatgtga agagctgtgg aaagcgccca aagtgagtta 20400 

gatccctatg agcaagtggg atgggggtgg agtggacagt aggagggctg gaacacacat 20460 

aaaagggtat aagaaataac aattaggccg gccaggggtg gtggctcacg cttttaatcc 20520 

cagcactttg ggaggccgag gagggtggat cacttgaggc caggagtttg agaccagccc 20580 

ggccaacatg gtaaaacccc atctctacta aaaatgcaaa aattagctgg gctggtggtg 20640 

cacgcctgta atcccagcta cttgggaggc tgaggcacga gaatc^cttg aacccaggag 20700 

gcagaggtta cagtgaactg agattgcacc actctactcc agcctgggag acagagtgtg 20760 

accctgtctc aaaaaaagaa aacaaaacaa gtaggtactt tctgccatag ggaggattca 20820 

taaactgcta gtcctcaggt gcatttttgc ttatcagttt taaaaatcag agaatgtctc 20880 

aaagaattag gatgtcagct tcttttgaaa atttgggcca gaagcggtgg ctcacgcctg 20940 

taatcccagc actttgggag gctgaggtgg gtagatcacc cgaggtcagg agttggagac 21000 

cagcctgacc aacatggcga aaccccgtat ctactaaaaa tacaaaaatt agctgggctg 21060 

gtggtgcatg cctttagttc cagctactca ggatgctgag gcatgagaat cacttgaacc 21120 

cgggaggcag gggttacagt gaaatgagat tgcaccactg cactctagcc tgggagacag 21180 

agcaagaccc tgcctcgaaa aaaagaaaaa gaaaatttgg aagatctgac aacagttgac 21240 

ctgcattcct gctcggcaac agcctgatgg tggatgggca gaggctcagt tgtctgccaa 21300 

acctcccatc actgatgtct tccctcgctg tcatcatctg cttgacatgt aggcatttgg 21360 

tgtgtgcctt ctgctctggg tgcccagatg aattggatgc tatatgagaa aacattctgt 21420 

aaatgtcttg tggtaggcaa cctcaaagat cactggggcc tccaatgatc cctccttcct 21480 

ggtattcatg cctgtgtata atcctctccc ttgagtgtgt actacacctg gatacttgct 21540 

tctaataaac agaacacagc aagggtaatg ggatgctact tctaaggtta aattacaaga 21600 

gtgtaaagtc tgtcttgttt gtttccctct cttgatcttc ctctcattct ctctctctcc 21660 

ctctctctca ctttcttact gtcttgtcct tccctttgtt' tactctgatg aagcaagcta 21720 

gcaagcatcc atgttgtgag ctgacctatg aagaggccca tgtggtggta aggaactgag 21780 

ggcagcctct acccagcaag gaactgagtc actcatcata tgggtgagct tggagacaaa 21840 

tccttcccca cttgagcttt cagatgacgg cagccctggc tgatgctttg caggcttgtg 21900 

agagaccctg agacagaaca ctcagctaag ctatacccta tctcctgaga tagagtataa 21960 

tacatgtagt tttaagctac tatgttttgg gataatttgt tactcagcaa tagataacca 22020 

atacatatac catgtacata actgtttcag ttgtctgaga ctatatttag tcattttaca 22080 

cctacatcaa gaatgtgtca ggcaccattc caggtacttg gaatacatca attaacagaa 22140 

taggtaaaga ggccaggcat agggctcaca tctataatcc cagcactttg ggaggcccag 22200 

gtgggaggac tgcttgagcc caggagttga gaccagcctg ggtaaaatag tgagacactg 22260 

tctcaactaa aaaaaaaaaa aattagttgg gcacagtggc acatgcctgt ggtgccagct 22320 

gctcaggagg ctgaggtggg aagatcgctt gagcccagga gtttgaagct ccagtgagcc 22380 

acggtcacaa aactgcactc tagcctgagc aacagaaaaa gaccctgtct caattaaaaa 22440 

aaaaaaaaaa aaaaggaaag aaagaaaaaa ataggtaaag atccttgatt cttgccctct 22500 

tggaacttct attctagagg gggatggttt ttcacagtag aagtctgtgt tgacagcgct 22560 

gtttaaagct ccttcagcat ctggggaaaa ggttnnnnnn nnnnnnnnnn nnnnnnnnnn 22620 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 22680 

nnnnnnnnnn nnnnattttt tagagatagg gtcttgctat gttgcccacc aggctggtct 22740 

tgaactcctg ggctcaagca atcctcctgc ctcagcctcc tgagtagctg ggaatacagg 22800 

tgtgcaccac catgcctggc ttatttcata tatatatatt tttatatata tgtatattta 22860 

tatatataaa tatatatata atttctgtat ataaataaat aaatatatat atatatatat 22920 

ttttagagat agggtcttgc tatgttgacc accaggtctt gaactcctgg gctcaagtga 22980 

tcctcctacc tctgcctttc aaagtgttgg gattacaggc gtgagccatg gcacctaact 23040 

gagttatttt taccacacga agcataggac atacatccaa aaatgttctg agctgagcaa 23100 

gagcctggag gcaagtgaat ctgaactttc ccgtctttga agaaaccagt ctctctccaa 23160 

agtcacatag ttagtgtcac tccccccaag aactgcatga gctgggacaa tcagagggca 23220 

gtggaaggtc tggggctcag gggcgccccc tgctgtctcc ccagggtctg tccccttacg 23280 

caagagcctc tgctccccca ctttcctgtg gagcctcctc accatgggca tgacccagct 23340 

gcggatcatc ttctacatgg ctgctgtgaa caagatgctg gagtaccttg tgactggtgg 23400 

ccaggagcat ggtgaggcac cgctgaggcc cctgggggtt gggggcacag gcgggtcacc 23460 

ctggctgagc tcccctcacc atacgtttcc ctacccacag agacaaatga acagcaacaa 23520 

aaggtggcag agacaggtag ggctatgaaa gcagggccct ggctcacgcc caccccactg 23580 

caacccgctt ctcagggggc gggactcctc taggcctggg cccacccagg taaccctttt 23640 

gtgggatgta agagtctggg ttcagaggaa ggctattttg gtgctctctg gcctccgctg 23700 

gaaggggtga tagtgtccac tgagtgccag ttcctgaccc cactgccctt cccatcctgc 23760 

ccagttgggt tctactcctc cgtcttcggg gccatgcagc" tgttgtgcct tctcacctgc 23820 

cccctcattg gctacatcat ggactggcgg atcaaggact gcgtggacgc cccaactcag 23880 

ggcactgtcc tcggagatgc caggtgacct gcctgtacag ggatggtgac agcaagtggt 23940 
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caggcagtgc ttttcatttt ctctgtgcgt ttacatccag cagcttgttg ctttctccca 24000 

agaaccctag gagatcaggg gtacctcccc attttacaga tgaggaaact gaggctagga 24060 

agggacctgg cttgcttaat aataagaata gctaatgcag agtgctgact gtgcacttgg 24120 

caccttgcct tgtttagtcc tacaacacct ctttgaggta gatgcgttaa tatcttcatt 24180 

ttgcagttga ggaaaccgag gtacagggtt gcacagttag gtcattcacc caagatcaca 24240 

cagctttcag tggcagcctc cagaacctgt gttataaggg tacacgctaa agtcttgtta 24300 

gggctagaat aggtagagtt ggtatattag atatttattg ctgtataaca aatcacccca 24360 

aggcttggca ttttaaaaca acaaacactt ctcatctcat acagtttctg acagtcagaa 24420 

atcagggaga gactcagccg gctgattctg agtcacagtc tctcatgaag acatagtcag 24480 

gctgtcagcc agggctgcag tcatctgaag ggctgactgg ggttggagaa tctatgtcag 24540 

ttcaattacc cccatggcct ctccataggg ctgctcagga cacagcacct gctttccctt 24600 

gagcaagagg gctaagcgac agagaccccg tatcttctct cacataatct cagacgtagc 24660 

ataccatcac ttctgttacg ttctattata ggcacagagc aaccctgata tactgtggaa 24720 

ggagactgga caaagcaggg gaataccagg aggcaggatc cttgagggct gtcttgttgg 24780 

ctggagacca ccattgaggg tttttttttt tttttttatt gagacagtct tgctctgtcg 24840 

cccaggctgg agtgcagtgg cacgatctca gctcactgca acctctgcct cccaggttca 24900 

agcgattctc ctgcctcagc ctcccgagta gctgggattc accatggagt cttgaaccca 24960 

gattctgtga ctgcttttgc tctttttgtg ttcatccaaa cagtccctgt ttatcctaag 25020 

aggatgggag aaagagactg ggagagaagg aaatccagtg gcctccctcc ctgctagcag 25080 

agcctggccc tggcactgag ccttcctcct ctaccctctg ctcctaatgg tgagggtccc 25140 

ctagcagggc ccttctgtcc aggacacatg ggccgcctgt cctcacccca gcctactgac 25200 

ctctctcctg ggctggcctc agtgcccttg attgtgccgg agagaggaag cgctggacag 25260 

tcaggccaag ctgctgtccc caggagggca tctgcttatg tctagggcag ggacaccttc 25320 

ctgaggactt ctgatgagag acggtgtgag agcttcccac ttcccacctt ccttcccatc 25380 

cttggttctc aaaccttcaa gtgtgcatga gaatcactta gtgggggata tttgtccaaa 25440 

tgcagatttg cagatatccc cgctgagatt ctgagggccg agatgaggcc tgtgaatctg 25500 

catgttaaga aagcacccgc tttgatgcgt gtgtcattgg- gtaggggagc aacactttga 25560 

gaaacatgga gctagagaac gtgggtttct atgggtttcc catagaaaca tggatttctg 25620 

tgttttctgc tgccctgaca tcgaaggcac atctgaaggg ggaggggcca ggccaagaac 25680 

cagggagtcc tgggaacgta gaggcagcag ccagtgactt cccgtactcc tcagggacgg 25740 

ggttgctacc aaatccatca gaccacgcta ctgcaagatc caaaagctca ccaatgccat 25800 

cagtgccttc accctgacca acctgctgct tgtgggtttt ggcatcacct gtctcatcaa 25860 

caacttacac ctccaggtac ccaccttcat ccttcccctc tccctgcctc ccgaggctcc 25920 

tccaaaggga tggtccatcc agcacctgcc ttccaggaag cgcagttctg gtcttctgat 25980 

ctggatctat tttccgggtt ctccaggaag tgtttctagt agattgggtt ggcgaggggg 26040 

tgggaattga ggcccagttg gcctcttcgc cctacccctc cttcctccag cctccacaca 26100 

ctctcctaac ctcttcactc tctctttttg gttttagttt gtgacctttg tcctgcacac 26160 

cattgttcga ggtttcttcc actcagcctg tgggagtctc tatgctgcag tgtgagtctg 26220 

ttgggctgaa atgccttcct gagctttgca accgtgatca gagaacccca gggaagggtt 26280 

gggagggccc caggcatccc ctaatgcacc tctctctgag accctctgat ggcagggagc 26340 

tcacttcctt aaaggcagcc tatcctgctg taattgactc cccctgttgg agtcttccct 26400 

tagaggaagc tgaaatacct ggcttgatga cactttggtt ctatgtctgc tgtttgaaac 26460 

ggcccccaga atggcctccc ctccatgccc accctgaaga aatttcccaa gggcagccat 26520 

ttgccttata attttcctct tcatgttgga cagtccccac ttgcatctct ctcctggttt 26580 

cccctgctgg gcgctgctga gggactctcc cctgtgtatg tgatggagta acaggacatt 26640 

acaataatga tgacaaaatg acaaccatta tcaagtgctc cgttggtgca ggcagcaggc 26700 

aggatccttg accatcactc cctgagttca gcctcactgc agcggtctcg gcagagggca 26760 

gctctctttc cttcatctgc tcaagccaga accctggagt ttccttgatg tttctctccc 26820 

tcacactcca tgttcactcc atcctcagta cagccagcag cagcttctac acaccccaaa 26880 

tctgaccctt cttgtcacct ccactgctgc ctctccagtc ctagccacca acatctctag 26940 

cctggattat tgtggcagcc tttagtctcc cacatctgcc ctggccccgc tgtctcagtc 27000 

tatttttaac acaggggctg cagtcacctg tcaggacata agtctcttca catcactctg 27060 

tggtgtcctg tctcatctgt ctcagagtaa aagccaaagg ctttactatg gcctaaaaag 27120 

ccctgcaagc tctggcccca gcacttcact cccctctagc tccccctcct ccattgttca 27180 

ctctgccaca gccacagtgc ttcctagtgc tccggaagtc tcaagtgtgt tccctgcttg 27240 

gcatctttgc atgtactagt ccctgtttct agaacattct tctccagata tctgcaaggt 27300 

gcccaatctt accttctctc cttcttcagg tctttccctg actgtcctct tctcagtgag 27360 

gcctcccttg gctgtcccat gtacaattgc aacctcccta ctgcccgctt ctctgcttgg 27420 

tttttctcag cgtttatcac taacactctg cctatctctt gcttattgtc tgaccgccac 27480 

ctgctccatg ggaatgccac ctcctcgatg gcaggaatct gttgacttgc ttgatcgtgg 27540 

tatctccagc acctagagca gtgcctggca catagtaggt tctcagctaa atgtttgttg 27600 

acagaataca gtggacagtc ctgcgaggtc aatgccatcc ctgttattag tggaggaagt 27660 

ggggctcagg gagtttgagc cacttgccaa tatcacacat acaggaggtg tgagaaccca 27720 

gctcagtggc cctgaagttg gagcatttgc cctcaaggct ggggaccaaa gagcccatgc 27780 
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aaagagcccg aacgcttaag caccaccctg cctggccagc ggggnnnnnn nnnnnnnnnn 27840 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 27900 

nnnnnnnnnn nnnnnnnnnn nnnncccact gcgcctggcc cattactttt aatggcaaaa 27960 

accacaatta cttttgcacc cacataaata gttaccatgg gctgagcatg gtggctcagg 28020 

cctgcaatcc cagcactttg ggaggctgag ccaggcggat cacttgaggc caggagttca 28080 

agaccagcct ggccaacatg gtgaaacccc gtctccacta aaaaatacaa aaattagctg 28140 

ggtgtggtgg cgcgtgcctg taatcccagc tattcaggag gcagaggttg cagttcactg 28200 

aaatcatgcc actgcactcc agcctgggcg acagaatgag actctgtctc aaaaataaat 28260 

aaataaataa ataaatattt accatgtttt gaccacctgt tatgtgccaa ctgtattact 28320 

taaaaacacc catgggaggc tgggcacagt ggctcacgcc tgtaatcgga cactttggaa 28380 

gggcaagcgg ggaggatccc ttaaggccag gagttcaaaa ccagcctagg taacacagta 28440 

agccctgtct ctacaaaaaa taaaaaaatt aactgggcat ggtggtgtgt gcctgtaacc 28500 

ccagctcctc gggaggcaga gggagaggtt cgcttgagcc cagcagtttt aggttgcagt 28560 

gagccaggac caagacacta cactccagcc tgagtgacag agcaagacac tgcctctaaa 28620 

caaacaaaca aacaaaagcg acctgtgggt aggtaggaac aggctcatag tacagatgag 28680 

aaagcagagc ttggagggct caagcgattt gccaagcaga ggtccaagcc gaggtctctc 28740 

tgaatccaaa gttaattccg tctatcatat caccacagcc ctctctgccc cagggagagt 28800 

ctctgcccac tccagccact cacgtgtaat tgacttcctc aggggcagga aaggcttcga 28860 

tgggccagtt gagggtgcag ttcagaaaga taaggcaggc caggccagac caggtgaaca 28920 

tgatgaccac gaaggccaca ccggcatcgt agatcagctg tgagaggagg gggcaggccc 28980 

gtgggggaga ctgcctggcc ccagacccca ccaaggtaga tcccaggcct cagaggcctt 29040 

aaagaagttc tcttctcccc ttgtccttgt gcccaatttg cagatgagga aaccaagacc 29100 

agaagtttag agtcagactc agaagaccca tcattccttt ttctttttca cttgaggccc 29160 

cctagagagc tatgaaatag tctccacaaa gcctgaagtt gctggccact ggctcaaaat 29220 

atctctgaaa tttccattat cttaaaaaaa tacatacatt tttgcctatg actccacaaa 29280 

cattcatgtt catgttcgca caaaaatgtc catttcatag tacgtacaaa ggaaacttag 29340 

tgctctaggt ttaccgggcc taatcgtgtt tatcctgccc- cttcctggca cattccccag 29400 

gggaaaaggc aaacccagac tgctcatgct cagccttttc tcacctttcc caggtcctcc 29460 

cacgtgcaac aactgggggg gttggggaga gggaggtgca' agtgctctgc ccaagggctc 29520 

tcaaccccag ggcaggtaag ttctcaattg aatgagattc tgtgcaaatg tgtcagccct 29580 

tcttatggaa gaagctgatg caccatctgt cctcttgtcc tccccatacc atctgaccag 29640 

gataattaat gtctgctctc ccctcaggct cctgctcaaa cctttttctc tgcagtcttg 29700 

gaccttggtg ccttttcctc cctaggggca ggacagagct tcaaagggcc acacccccaa 29760 

atgtgtggag gtaagatctg gctcttcaaa cactacttca gttgaaaaga agggagaact 29820 

gcccaccctc catgcctgcc caccagaaca actgatggcc cccccaccca tgcgctctct 29880 

caaactcctt tggagacact gagcaaaagt accttcttta gtactctttg taaagtgcaa 29940 

aacggtatgc agtttggtac tgcccaccgt ggaggttgag gagcatggca tggctcaaag 30000 

ggtcctttga tatttgacag aggaaattga ggcccccatc ttgcactgag ctaaaacttt 30060 

ggtcccctgg cttcgaggta caccaggttg acctgtccag gatccagcct ggcataaact 30120 

cactttgtga ccttggacca aaccacccat cctctctgga aggtgtggaa aaatgtggcc 30180 

ccaaaggctg aataaagcca gagagtcagg gaccttgaac gcatgtgaag gggctggact 30240 

tgattctgta ggtgaagcta aaccactgaa ggtttttcag cagtgtgtga gccagttccc 30300 

catctgagat ctttctggaa gtcacgtgag tgacagagta cagagaaaaa gaatcagagg 30360 

cagggagacc agctgagaaa gcttgctgtg gcccaggaga gagggggaag gcctgcattg 30420 

ggatgatgac agagaaagga gagcggagaa gtcagacccg tgggtcagca ctagctgctg 30480 

ctcactcggc cccacccggt tcttgtgtca agacaaaaag aaaacccagg tggcctcata 30540 

ccttgattcc tgggaacgta atggcagaag aggcgtaaga gccaatcatg agggccatta 30600 

acgtggagcg caggttccca aacatgttgg gcagctgagg agggaaagca gcacccatga 30660 

ggtggggaca ccgtgaccct tgcccagcat tcccagccct gctccataca atagctccag 30720 

gagacgcagc agaaaagccc caaggtaaaa caaacagaaa aatcaatgtg ggaaactgta 30780 

ctctgccccc tgcctacaca gtcacagtgc cctttagctt caaaaaggct cccagacacc 30840 

cctcagagag acattttgtt aattttgttt aattccaggt ttcccaagtt tgttacgtaa 30900 

cacctctgaa aaacacatgg aataggtgct taagaaacac tgatcttggc tgggcgcagt 30960 

ggctcatgcc tgtaatccca gcactttggg aagccgaagc tggtgggaag cttgaggtca 31020 

ggagttcaag accagcctgg acaacatggt gaaaccccat ctccaccaaa aatacaaaaa 31080 

ttagctaggc atggtggcat gcgcctgtaa tcccacctac tccagaggct gaggcaggag 31140 

aatcgcttga acctgggagg tggaggttgc agtgagccga gatcgcacca ctgcacttta 31200 

gcctgggtga cagagcgaga ctatgtcccc accccccaaa aaaaaagaaa agaaaagaaa 31260 

gaaacagtga tcttgtccaa cccatttgag atgagacaat tgagacccag ggaggaaaag 31320 

tgtactcaag ttcacagagc acattaatgg ctttctcccc attgtcgttg tcccagccct 31380 

aacccaaggc tgtgaccatg gctgtgtccc ggtaataggc agtgcctctt aaccctctcg 31440 

gttgacgtcc cagcccagtt tctgcctaat caggacaaat cacatcctgg gaggtgaggg 31500 

tggaaataag ggagggaact gagccagggc agacagtctc cagaggaggt ggctctgacg 31560 

cagagcaggg tcagaaccca caccaggaga gaatttaatt gatcatgtgt tccactcacc 31620 
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tgcctcagcc aagccctcag ggcaggggaa ggcaaagtca ggatgccctt cgcacacacc 31680 

ctcctctggc cccaccatcc tccccaagtc actagatccc acagctgaga aggaccttag 31740 

gatccgtaca aagcctaaac acactccaca gagggggaaa ctgagactct gaagggaggc 31800 

ctcaacagct ctggtaaaaa aggcgtttag gccgggcgca gtggctcaca cctgtaatcc 31860 

cagcactttg ggaggccgag gcgggtggat tgcctgagct caggagttcg cgaccagcct 31920 

gagcaacacg gtgaaacccc gtctccacta aaatacgaaa aaattagccg ggcgtggagg 31980 

cgtgcacctg tagtcccagc tactcgggag gctgaggcag gagaattgct tgaacctagg 32040 

aggcagaggt tgcagtgagc cgagatcgcg ccactgcact ccagcctggg cgacactgcg 32100 

agactccgtc tcaacaacaa aaaaaaaaaa atggtgttta aacacatata actaaattat 32160 

ccttccccct tcccctgaag tggctggctc aggaaaaacc tctacccact caggcagagg 32220 

ttttcctgca ccctgcatcc gtgaggcacc actgccaagg acgccaggga aggctgccag 32280 

gcctggagag gggcagggcc ccctcccctc caaggggcca caaacgctgt ctgcgcccag 32340 

taccgtgggt aaggcgaggc cggccggcta accccgggct ggcggccttg cagcgtgcgt 32400 

ggcaacagca gctgggcccg caagactcag cacgggacgt cctcgtccaa gtctgggcca 324 60 

agagcagcgg cccagggggc ggggccggcc agagggagcg gggagaggct gaggggcggt 32520 

gccagcgccg gaccctgcca ttggctggag attacaggag gcggggacat agcagggagg 32580 

agccgctgga caagccccac ccggccgcca gggagggtct gaggtcaaga gccggagaga 32640 

agggatttag ggccctgggc caagttgcac agcagggaga aggggctgcg cagaggggcg 32700 

gggagaaagg gatccgcttc cttcctttag agctgtgaaa tgtccccggt tggaattaaa 32760 

ggcggctgct ggggagaggt gaaattcagc caaaaccacc cagtcaggca gcccttctca 32820 

gagataaaca gtccgagcca gcccggccag gaaccttccc ctccaacctc cctaagcctt 32880 

taacactcct aagcctttaa cgcgtttaca cactcacata aataaacaca ctttgagcaa 32940 

cacacataca ccactcacca catgtaatag gtcaagccat gtgcacgacg aggtgtcgac 33000 

aatttcatat ggttcaacct agtacactca caaacacacc taccaactca tggctttcac 33060 

agggacgggg tcacacaccc actctcccac gacatggcaa gcgtgcacac gctatctcaa 33120 

gctgctccct ccccctcaag atcatgttac ccagttttat tttcttccca gcacctatga 33180 

cgactgacat aatttattag tttacttgtt tattgggtta • tctgtgcccc tcacccccaa 33240 

aatgtaacct ccagcaggga ggatgactcg gtcagtcctg attgtgctgt agtccaggac 33300 

ctagaacaga gctccatgga cattcatggg ctctgtacac acaaacacac acattaacat 33360 

acaccccgac acacagcctc atccacacac acacagcctc acacctgctc tttgcagcca 33420 

cctgcacagt ttctcacaca ctcacttgat ctagtgatct gcgtccacag gcccctcccc 33480 

cagcccactc atactgccct caccccactc actctgccct caccccactc gggggaactc 33540 

tgctgccagg ccaggcctgt gacactcacc gtgagtgaag tgaacgttag gcagatgcca 33600 

ccaaagccat tcagggacag cgccaggaat atcaacggag acagagctgg aaaggggaaa 33660 

gcagcagatg agggcatttg gggagctgtg ggaagccaag ggcgggagct ggggtaaaca 33720 

tccgccttca tcccacctat tcttttcttg tggggccaca agaggacaga caactcacct 33780 

tccacgtccc gggaggccag ggccatgagg gtgcaggacg cagtgaagca ggcactgtgg 33840 

agacacaggg aagggcgagg ggttggcctg tgagcacccc ccctcccctc cccctgcagc 33900 

acggtccctg tcctcccgtt ccccatagcc cagccacctc acctgccaac cagccgcacg 33960 

ggtcgggggc caaagcggtc catgaggatc cccagtggca gggtggtggc gctgagcacg 34020 

aaggaaccaa tggtgaagcc caggttgagc atctcgtcct gctggtcaca gcctggccac 34080 

ctgcgctgct catcctgggt ggtgttggtg ctgctctcag ctgaggaggg ggaagggagg 34140 

gctcagcaca tgacaccagg aacagctggg cacaggagac agcagcccac agtcaggcgg 34200 

cctgctttca aatcccatgc caagtgcctt tgggggtacc ctagagtcac atctcctctg 34260 

atggggctgc tccagaaatg gcagccatta gtacctgacc ctgggagagt cttgtgcaca 34320 

cacagcctga ggcttcaact agctcaaatg aaatactgga cataaaagta tttactaagt 34380 

tgtaatatgc actcagtgtc caagcttagg gggttgtgga cccccaacaa gaagtgcccc 34440 

catatctaga ggcaaaggca aaggcagtga gtggtactct aatggctata acaagaattc 34500 

attaaaatgg cccggcgtgg tagttcatgc ctgtaatccc accactgtgg gaggctgaga 34560 

caggcagatc gcttaagcct acaagtttga gaccagcctg ggcaacatgg taaaacccca 34620 

tctctaaata aaaaaaagaa atttagaaag aacactaaaa cttagaggaa gctttcccga 34680 

taaatgatag tctgataaaa taatagctaa tacttattga gcacttaact atgctccagg 34740 

cactgttata agtcagttaa taaagtatcc cgttccctag gtgatgaagc tgaggcacag 34800 

aatgagaacc aggcactgcc ctccagtccc ctctagaagt ccacttggag gacttgtcct 34860 

taacggtaaa ctgccaactt ggagttgtga caagttaagg agaaaagcta gtgataggag 34920 

acaaagggct gcttcgcttt actcaatgct cannnnnnnn nnnnnnnnnn nnnnnnnnnn 34980 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 35040 

nnnnnnnnnn nntccctccc tccctccctc cctcccttcc ttccttcctt ccttccttcc 35100 

ttccttcctt ccctccatct tctccacacc tggtatcatc atacagaagc agagaggact 35160 

gcacttggtg aaagtttcaa ttctcctgtg tggagaggtg agcactgagg aaggggtggg 35220 

ggctgtcaaa ggagacttac ccaatctttc cagcccacca atcccttgcc cagtgtttct 35280 

ataaaataag ggccttttgc atctgattta agtaggaagc tgattcctga gcccctcaga 35340 

tctgctgaat cagatagcta agggggccct ggaatctgca ttttagcaag cggaggtggg 35400 

ttatgaagca ctccaaagtt tgagacacgc ctcaaaggtg gagtggttct gtggggggca 35460 
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gaaaggaaaa 
tggaaagggt 
caagccagac 
cgaagcagac 
cgtggagacc 
ctcatccccg 
gtccatcact 
ttctacccca 
ccttfcgctat 
ctcgcccctg 
gaccccgcca 
ggattgcgcg 
cgcatatatg 
aggcttcgaa 
cgcaccctct 
ccagctcgct 
cagacgccct 
ggccagaggc 
ggctgcagat 
cgggcagagc 
accctcggag 
cgcacgtggc 
actattcgca 
agccagccga 
aaaaagtcag 
ctttgccaag 
tctctcttat 
aatgctgtct 
actcttggcc 
gctatccttt 
cccttgggtg 
caacaccaac 
atcctaaacc 
tgtcccagag 
ctcttgtcct 
ttatttattt 
aaggaagctc 
actttcggca 
tctaaagcat 
aaatcctggg 
gttgggaaca 
ctagttaatg 
caccaaccgc 
taagacttgc 
gacccccaac 
agggctgtgt 
gcacatggga 
accgctagaa 
aacccctcta 
gaaggttttt 
taaatctacg 
cttttggttg 
gcggtcacct 
tgctggagcc 
aatagcagga 
agtcatctgc 
agccagaata 
ttaatgttca 
agggcctggt 
cactgtagga 
tctgaccttt 
agacacagtt 
gacacgcggg 
gcaaataatg 



tgcaaagggg 
gatgccaggt 
cggggaggag 
acttgcaggg 
ctggagagcc 
ggcaccagcg 
cccccggcca 
cagtccccag 
ttgcgatccc 
ggagggcggc 
ccccgtcccg 
gcagggtctg 
cccccctttc 
attcctcgtt 
cccctcccca 
tccgcccctg 
tctcctagga 
aagcttcgag 
aacgcttgca 
aagccccggg 
ccagccgcag 
gcttaccccc 
aattgaggat 
acccgctggt 
tgttggaagt 
tctccagacc 
taacttctca 
tacagagcca 
ccaggagcag 
ccacagactg 
tgtgccttaa 
ctcttaggct 
aatgttagtt 
gcacaagcag 
cactgatggt 
actttaaagc 
aaaatttgca 
ggccctaggg 
tctgtcctaa 
aagtaggggg 
gggaggtgag 
cccagctgag 
caaagacagg 
aagcagcagc 
ccctaaccca 
ctctctcaca 
cctcaggggt 
ggagataaga 
ggatagggac 
tttcctcctt 
gccatggctc 
ctcaggcctt 
cagagccgct 
gggcacggta 
ggcccagatt 
ccttagcctc 
aatacagcta 
gcaattctgc 
gggtgcccac 
gtctgggctg 
ggagatgtta 
caaggaactg 
ctctgactcg 
gcaattctta 



gaaggggtca 
gtggggagta 
ggggccacag 
tgcaccccgc 
cccaaccttg 
gccttcccgg 
gagcccacca 
tcctagctca 
agaactcgtt 
cccattaacc 
cggcgggggt 
ccacagggca 
tgggaaaaga 
ccctatcctc 
gccatctgtt 
cccagccccc 
ggccggaatt 
accccccacc 
aggacgggag 
aagaggcagg 
ccacgcagcg 
acccccgggt 
cccggacaca 
gggtgctagc 
cgggagtctg 
cctgaggaca 
agagaaacaa 
acctttggag 
ctgagaaccg 
ccgaggttcc 
cctctctggg 
atagtttgga 
ttcccttcat 
gtgcagggct 
gtcttctctt 
ctcactttaa 
taaagtttta 
atgctatggt 
tcctctgtat 
caggagctgg 
ccagacagcc 
cctgggtgaa 
cgcacaccag 
accagagagg 
ggacacagct 
ttcacacata 
agcctgtttg 
ggcaccctgg 
tgtcttcagt 
tttgcagtct 
tatgtgcatg 
gattgcctgt 
aagcaccttc 
atagaagagg 
cacctttagg 
ccacagggag 
gtacttatta 
aaagtggctg 
accatatggc 
gtcaggctgc 
accaatggga 
ggatgcacag 
gttcacatcc 
ctgagtgcct 



cacttgggga 
acagatagag 
ccaaggtgag 
cctctcttcg 
tttctggggg 
gaggctcaac 
acgctcctcg 
tctgcataaa 
ccccaccccg 
ctcgcgaccc 
ctgggggtga 
gaggccaggg 
cggggagggg 
cggcccccgc 
ccactccgca 
tccccaagcc 
tctgcctcca 
aaccaccacc 
tcggggaggg 
ggttttccct 
ccgcctgccg 
ccgctcctgg 
gagtgcagag 
caattctgat 
ggctcagagc 
ggttttccta 
agacaaaggg- 

gtgggggaga 

gaaagaagct 
aaattgagct 
cttgtttcct 
taaaatgaga 
ttggggactt 
ggataaataa 
gatatagata 
tgttaaaggt 
agataaaata 
gggaagtttg 
ggagaaaagc 
actccctcca 
agaggcgaac 
gaaggatggg 
ccagtctctc 
gaacctgccc 
ggcacctcag 
cacagacaca 
ccgatccccc 
tctcctccaa 
caatggagcg 
ttacaaaaat 
ttacaggtag 
catccaggtc 
agtgggccca 
taggaaggca 
gcaaggagag 
gagaaggcgg 
tgtgtagtca 
agatgagact 
actcactagg 
tcccgaaatg 
tcccgttcag 
cctggtggac 
cactctgcat 
ccttctcagg 



aggtttcaga 
gaggcaaagt 
acaggtcagc 
tggcaatctg 
gtgggtcaga 
acgcagatac 
aggtccgacc 
gctccaatta 
agcccgtttc 
gggccgctcc 
ggggcgcgcc 
ctctccggga 
ggcttctcct 
acccctcctc 
gcgccgcgac 
ccggggagtg 
tctcccaccg 
accgttgcga 
tgtagggcga 
cccgggtcgc 
ggcacaccaa 
ctcgcgctca 
accccggcaa 
tttgtacttt 
agcagggatc 
tctgaaaacg 
agggaaaatg 
tggccaaggc 
tgggacctcc 
ccaccaccta 
acagcgacaa 
tagctgtgta 
gctctaacct 
ggtatgtctt 
attttaaagc 
aaatgtaaat 
ggagactcca 
agtcatacct 
cagcttcctg 
agcactaagg 
gggctggcat 
ggtgtgggga 
acttcccttt 
tcctggccct 
gcccctttcc 
tgcatgtgtg 
caagaggtac 
cccaaggagg 
ttgacttagg 
agaacttctc 
aaaagccata 
ccttggtctg 
tcccattggc 
acaggaatcc 
agaaacagag 
ccatttttct 
ttgttccacc 
tctcaggtat 
taggtatgag 
gggccttctg 
ggtggcgaga 
agaaggcttg 
tactcactgt 
gctgttgtgg 



caataccgag 
gagtggagac 
agccagaaac 
agaccgagga 
gaggaagcct 
ctggataggc 
ttgtccctcc 
acatgttttt 
ccgccgcttc 
tggcggtcct 
ctggggcaga 
aaaaggcagg 
gggagactcc 
ctccccgcca 
aaacacggct 
ggggagtgag 
gggtccggct 
gggccggtga 
gtttaaagga 
cgccccccgc 
ggacctggcg 
gcctccccag 
gcctactgaa 
acaaaaacaa 
tgcgatgtga 
gaggggacag 
gcttagctgg 
ctctgaggtc 
tttctgcaga 
acactgtgtg 
gaaagaatga 
gaacagacag 
ccagggctta 
tctgcaggat 
ttcacgttat 
atagtataac 
aaaaagtgtt 
tagcattctt 
gatgtacccc 
gcagggcatg 
gccaagcgtc 
agacaccccc 
ttatttcctc 
ggaaggggcc 
ttctgaaagg 
cacactcatg 
caggaggcag 
aagaaagctc 
gggcgttttt 
ttggtattta 
tggggcactc 
agaagtctat 
ggcgtactcc 
caggagtgag 
tcaagtaggt 
ccaggtcctg 
agtatctcac 
aacaagtggc 
gaaggcacag 
ggctcacccc 
ggaggctctc 
gaaggcccag 
gtgactttgg 
cgaagatgta 



35520 
35580 
35640 
35700 
35760 
35820 
35880 
35940 
36000 
36060 
36120 
36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
38580 
38640 
38700 
38760 
38820 
38880 
38940 
39000 
39060 
39120 
39180 
39240 
39300 
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agttaaaaaa aagtatgcat catgcttagc acatagtgag tgcttggtaa atagaagcag 39360 

ttatttcatc acaattcttt gggaggaggg tttacgtgtg ggtggcccca cagggcagat 39420 

gaaagatcag cgtcagggag gcagatgagt tcaatgtaag gaaaagactt actaacagca 39480 

gcagggctgc ctcgtgcagg agtgggtgcc ctaccactga gggtatctaa gctaagaggg 39540 

aagggtcccc tttcaggggt gctggagaca ggatcccaca ctaggtagaa ctggattgga 39600 

ccaatggtgc ctgaacacag gcccaagagt caggactggc cacttcacaa agcacctgga 39660 

gtttactaaa aacagactcc taggaggtca ggcactgtgg ctcacgcctg taaccccagc 39720 

actctgggag gccaaggtga gaagatcatt tgaggccagg agtttaagac tagcctgtgc 39780 

aacatggcaa gaccctgttt atctgtacaa aatttttttt taaaaaatta gccaggtatg 39840 

gtagccatca cctgtggttg cagctactca gaaggctggg gccggaggat cgcttgagcc 39900 

caggaatcag aggctgcagt gagctgtgat tttaccaccg cactccagac tgggcaacag 39960 

aacaagacac cttctctaca aaaaaaaaaa aacaataggg ccgggcgcgg tggctaaggc 40020 

atgtaatccc agcactttgg gaggctgagg agggcagatc acgaggtcgg gagatcgagg 40080 

ccatcctggc tagcacggtg aaaccccgtc tctactaaaa atccaaaaaa aaaaaaaaaa 40140 

ttagctgggc gtggtggtgg gcgcctgtgg tcccagctac ttgagaggct gaggcaggag . 40200 

aatggcatga acccgggagg cggagcttgc agtgagccga gatcgcacca ctgcactcca 40260 

gcctgggcaa cagaatgaga ctccgtctca aaaaataaaa ataaaaataa ataaataaat 40320 

aaaataacaa taaattaaaa acaaaaacag actcctacgg tcaggctgag atatcctgat 40380 

tcaggggact ggggaatctg tatttttaac actccgtgag gggttctaaa aggcagacaa 40440 

cttggaaacc tgcagattag agacctctga ggtgcctctg gctgagatga gtgagggatg 40500 

gcaccacata caaggcccta cccctgcccc caggagagtg gctcctgctc cccccacacc 40560 

aaccctcgct ctcacccaga agggctctcc tttcaggggt cccaccatcc ccatgaaaag 40620 

tggctgctga agcaaggcga acacagcact ggtgagggac tgcaggcctg tcagcgtccc 40680 

aaaaggggtt ggatgggaac ctgtccccaa aacgggagat caaagggtgg tgggggcctt 40740 

tcagcccagg caagaacttt ttcttttcct tcccaacatg ggnnnnnnnn nnnnnnnnnn 40800 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 40860 

nnnnnnnnnn nnnnnnnnnn nncactccag cttgggtgac agagtgaaac cctgtctcaa 40920 

aagaaaaaaa aatcttaaag aataaggata taaagaaaga aaatattttt gtgtagctgt 40980 

tcaatgtttg tatttcaagc caagtgttat tacaaaacag tcaaaagttt ttaaaaattt 41040 

aaaagtttat aaagtaaaaa agctaagtaa gctagggtta atttttttat cgaacaaaga 41100 

aaaatatctt tgtataaact tagtgtagtc taagtgtaca ttgtttttat tttatttatt 41160 

ttttattttt ttgaaatgga gtttcactct tgttgcccag gctggagtgc aatggcatga 41220 

tcttggctca cggcaagctc tgtctcctgg gttcaagcga ttctcctgcc tcagcctccc 41280 

aagtagctgg gattataggc acccgccacc atgcatggct agtttctttg catttttttt 41340 

ttgaaatgga attttgctct ttgacccagg ctggagtgca atggtgcaat ctgggctaaa 41400 

tgcaacctcc acctcccagg ttcaagagat tctcctgcct cagcctcctg agtagctggg 41460 

attacaggca tgcaccacca cactcggcta atttttgtat ttttagtaga gacagggttc 41520 

tcaactaaag agaaccatgt tggccaggct ggtctagaat tcctgacctc aggtgatcca 41580 

cccacctcgg cctcccaaag tgctgggatt gcaggcatga gccaccatgc ccagccagta 41640 

tacagtgttt ataaagcctc cagtagtgta cagcaatgtc ctagaccttc acattcactt 41700 

actactcact cactcactca cccagagcaa ctgccagtcc tgcaagctgc atgcatgata 41760 

agtgccctat ataggtgaac cattttttaa tattttatac tatattttta ctgcaccttt 41820 

tctatgatta gctacacaaa tgcttaccat tgtgttacaa ctgcctacag taatcagtac 41880 

agtactatgt atgggtttgt agcctaggct ataccatgtt gcctacgtgt gtagtcgtct 41940 

atactgtcta gtttgtacac tctatcatgt ttgcataaag ataaaatcac ctaatgacac 42000 

atttctctga gtgtattcct gttgttaagc aacacatgta taaacattta caagaaatag 42060 

ctcaaatttt tttttctttt gatacagggt cttgctttgt cacccaggct ggagtgcagt 42120 

ggcgcaatct cggcgcactg cgacatctac ctccccggtt caatcgattc tccggcctta 42180 

gcctcctgag tagttaggac tacaggcacg caccaccacg cctggctaat ttttttgtat 42240 

ttttattaag agatggggtt ttgccatgtt ggctaggctg gtctcgaact cctgacctca 42300 

ggtgatctgc ccgccttggc ctcccaacat gctgggatta caggcatgag ccaccatgcc 42360 

cagccattac gtttttttgg ttgtttaatt tttttttttt taagagacag attctcactc 42420 

tgtcatcaag gctggagtgc aatggcacaa ccatagctca ctgcagcctc caactcctgg 42480 

gctcaaggga ccctcctgcc tcagccttcc cagtaactga gactacaggt gtgagccacc 42540 

atgctcagct aattattttt tatcttttat tttttgtaga gggggggtct ttctatgttg 42600 

ctcaggtttg tctcaaactc ctgggctcaa tcaattctcc tgctttggcc tcccaaaggg 42660 

ctgggattac aggtgtgagc ctgaaaacct tctagtgtgg aagtggaaga taggcccagg 42720 

ccacttatgt tttcaagtta agcaaggttt aggtcactta tgaagcctga ctagttttgt 42780 

ttgcttaagg gatctgcagg cctgacctcg gttttcattt gttttaacag tgtctatgtg 42840 

tatgtgtgtg tttatgtacg tgcatgatgg ggggaaagct cagaaatcaa gtaagccaaa 42900 

cacaaacatg taattataag cagggataaa ttctatgatg aagaagtatg ggccacggga 42960 

gagtacttgt gccagtctgg tgatcaggaa caatgtcctt tgggaagtga catttgagcc 43020 

atgccctgaa gtacggtagg agttggttag gggtgaggca gtaagaccca gagctggggc 43080 

ttcctgcaca agctcagctg ggcactgagg acccagtgga ctctgctaca gggcagtgag 43140 
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gagcagaaag gctgaggaag gctgggtgtg gtggctcaca cttgtaatcc cagggctttg 43200 

agaggctgat gggggaaaat cggtagagct caggagtttg agaccagcct gagcaacata 43260 

gcaagactcc atccctgtaa aaagctttta aaaattagct gggtgtggtg gtatgcatct 43320 

gcagtctcag ctactcaaga ggctggggta aggattgctt gagcctagga ggtggacgct 43380 

gcagtgcgcc acgattgtgc cactgtactc caacctagga gacaaagcga gatcctgtct 43440 

caaaactgaa tgaataggct gtgtgcggtg gctcactcct gtaatcccag cacttttgga 43500 

ggctgaggtg ggtggatcac ctgtgattgg gagtttgaga ccagcctggc caatatggtg 43560 

aaacccgata caaaaattaa ctgggcatgg tggctcacat ctgtaattcc agctactcgg 43620 

gaggctgagg catgagaatg tcttgaaccc ggggggcaga gggtgcagtg agctgagatc 43680 

gcaccactgc actccagcct gggagacagc gagactccat ctcaaaaaaa aaataataat 43740 

aataacaatt aaaaaaaaat taaaaggcca gggagcactg gcagcctgtc caaggtttca 43800 

ggtcacttta gtaaagggag aacaatggct cctcccagga cctctgggat ctcagcattg 43860 

atacgacagt catggaaatg ctagggccca ggcagaccat ctcagggaaa acaagtggct 43920 

ctgccctgcc ttggccactt cctggccctc tgcatgcccc agggtctcag caccaagctg 43980 

ttctcagtga gtagctctca tttagtgcca gggctctcgg gcttacatcc tacgatgacg 44040 

atggaatgca taaaagatgg ggctgtgata gcccagagct aggggtttga atctcatgag 44100 

atgttcatgg agccctggga gggagctcag tgcaagttca tttctctttt ttggttgaga 44160 

tggggctcag aggaggaagg acttgttcaa agacacacag ggagtgtttc agtgtgggac 44220 

ggaggtttat ggagaaaggg tgaccatcca aggcttggac aaagatcatg acttcgacca 44280 

gcaagcctca actctgtaga cttggtgggg gccaggccct cccaaacaca cctgacaggt 44340 

gtctgtggtc ttggggacat tgtcgctccc cttcctgctg atgctctgct gtccctctcc 44400 

catgaagcgt atctcttcgc cgtcccccat ccttgctgag agaggatggg ttctcttctg 44460 

accaatactg aagatcttta gtaaagttct cttttttttc attttctgaa agtccctctc 44520 

ttgagaaatc aggacaagtg agtcagggcc aggacaaaaa acagtgtggg acgagtgtgg 44580 

tggctcacgc ctgtaatccc agcactttgg gaggccaagg tggcggatca cttgaggtca 44640 

tgagtttgag actagcctgg ccaacatggt gaaacctcgt ctctacaaaa tacaaaaatt 44700 

agccaggcgt ggtggtgcat gcctgtaatc ccagctattc- gggaggctga ggcaggagaa 44760 

tcacatgaac ccaggaggcg gaggttgcag cgagctgaaa ttgggccact gcactctggc 44820 

ctcttggcaa cagagccaga ctacctctca aaacaaaaac aaaaacaaac gacaaacagt 44880 

gtagactttg tgtttttctc aaaagcactg tcaagccagt gcccgcagca gtgggcctag 44940 

acacctccag tcttgcctca gggtcagttt ccagcctccc tggacacttc ccccaggtat 45000 

gtgtactttt tgattgtcct aaatccagag tctgtggcct gacctggttt gtcacagctc 45060 

tcagtccctc cccatcccga atcccaggga gccgcaggtg tgtgcagaag aggcacacca 45120 

cactcaatac atcttgcatc ctcgctggac ccaatccatt ggcttggtga tgtacagact 45180 

gagcctcatt atagccgttc gttcctgttg acctttccag atcaatctgc cagcttggct 45240 

tctccgagtt tcgcttgtca gcatttctcc aatcccatca tgtactttgg acctctttgt 45300 

tgggtggctt gctttatctg aaattttcag atttgacttc aggtctctcc tttgtccctt 45360 

aatatggctt aatggtggac cctgtcaggg gtagagaaaa tattgaggag ccctgacttt 45420 

gaggtgcaca agttagaggg ttagacaagt ccagccacaa ccagcccaag ctgcagtgta 45480 

gggaggcctg tccagctgct ccacggttga gggtggagca tacaggaagg cttccttctt 45540 

gctgcagccc aggtgttctg gctgccctag ctgcctggct ttggtagaag aaagaaaggc 45600 

tctgtctctg acttgtcaac taatggcact atgagattgc acataattaa cctgggtctg 45660 

ctcttccaaa agccttgggc ctctgactgc aacatggagt ctgggtatca ctccccatcc 45720 

ctgcgccact cacctgctct ggcgctaggc gtgtgcctaa tcacttaatt tctctgtgct 45780 

gcctcttagg tatcacttcc cctgatccca aatacttacc aggtgtggga tgacacctga 45840 

ctagttactc cttggaggta tctgcttctc accggggact ccgaaaccaa acgaaaagca 45900 

aggccaagcc cagcctaaag gacgcttcct acatgacttc aggcttgcgg gggctggagc 45960 

gtgggggtgg caatggagtt ggggggggct cagggagggg atgtggaagt gctttgcttt 46020 

gcaaactcta gagaaccgtg taaataggag tgattattct gtcccttccc tttctttcca 46080 

acaggaatca gcatcccaca gcccatgttc agctatgaag aatggaaact gaggctccgg 46140 

gaggggtata gggaggagcc agcagggtct tgagttcata ttagtgccct ttcctccata 46200 

ggcacatctg tgttttcttt tattttattt tgaatttaat tttttttttt tttggcagag 46260 

tcttgctctg tcgcccaggc tggagtgcag tggcgcggtc tcagttcact gcaatctccg 46320 

cctcctgggt tcaagtgatt ctcctgcctc agcctcccga gtagctggga ttacaggtgt 46380 

acaccaccac acccagctga tttttgcaat tttagtagag acagggtttc acagtgttgg 46440 

ccaggcttgt cttgaaatcc tgacctcaag tgatctgcta gcctcggcct cccaaagtgc 46500 

tggtattata ggtgtgagcc actgcgctcg gccacatctg tgttttaaat gagaggaaag 46560 

gggataatgt gcattttgtg gaagcttggg ccgtttgtgt ctaggactct tatgatcttc 46620 

ataagttttc ccccagggag gacactgttc cacttaggga gtcaggaccc ccagtcctta 46680 

caagattcag cctctcaaaa tggagacagc agttccaggc ctgggctggg ttctgttcac 46740 

actaggagag ggcaagtgag tggtgtttgg gatgtgggga agtattatga aaacagagat 46800 

gctccaattc ctagtgatag gaaaccatta agctacttgg catcttaaaa ccaagagcgg 46860 

ttcaagttct gagattgtta acacacctta caacaccgcc gccgttatta ggaagaagct 46920 

ctgtttgatg acgtcccaca ctgtgggtac ctttatgaac aggaatttgc tttttcaaat 46980 
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cccagagaag taagattaaa gttggctgtt ctccatcctt gaaaaatttg gttttagggt 47040 

gaattcaaga atgactgacc atacagaatg gggagcaaac ttgggaagaa agaaggcaca 47100 

gttcagagct ctcccaatag tcacccctga actgcacccg gaccatcagt tatctctgtg 47160 

ggtagagctc aggaatctaa aatccatttt aaaattaaag tatatcgggg ctgggcgcgg 47220 

tggctcatgc ctgtaatccc agcactttgg gaggccgagg tgggaggatc acgaggtcag 47280 

gagtttgaga ccagcctggc cacatggtga aaccccgtct ctactaacaa tacaaaaatt 47340 

agccaggcat ggtggcagac acctgtagtc ccagctattc ggaaggctga gtcagaagaa 47400 

ttgcttgaac ctgggaggca gaggttgcag taagccaaga ttgtgccact gcactccagc 47460 

ctgggcaaca gagggagact ctgtctcaaa aaaaaaaaaa aaaaaattaa agtatgtcat 47520 

acatactgtt acaggcacag accttaagtg tacagcccaa tgaaatttta cacatctata 47580 

cagctatata actaccacct atatcaagac acattccagg aactcagact ccatcatacc 47640 

cctcctcagc agaggtaaca gacccacacc tctcctgctc cggtggtaat taaccactat 47700 

tctaactttt ctatcaatta gttttgccca ttcttgagct tcacacagat atacattgtc 47760 

aggcatgatg actcatgcct gtaatctcag cactttggga ggccgagacg ggagtatcac 47820 

ttgagcccag gagttggaga ctactctgga caacatagtg agacccccga ctctacaaaa 47880 

aaaataaatt agctggtcat ggtggtgcgt gcctgtagtc ttagctattt gagacgctga 47940 

gagaggagaa tctcttgagc ctgggaggtt gaggctgaag tgagccgtga ttgcaccact 48000 

gcactgcagc ctaggtgaca gagtgagatt ctgcctcaaa aaagaaaaaa tatggccggg 48060 

cgcggtggct caagcctgta atcccagcac tttgggaggc caaggcgggc ggatcacgag 48120 

gtcaggagat ggagaccatc ctggctaaca cggtgaaacc ctgtctctac taaaaataca 48180 

aaaaaagaaa gaaaaaaaaa ttagccaggc atggtggcgg gctcttgtag tcccagttac 48240 

ttgggaggct gaggcaagag aatggtgtga acccgggagg cagagcttgc agtgagccga 48300 

gatcgcacca ttgcactcca gcctgggcga cagagtaaga ctctgtctca aaaaaaaaaa 48360 

ggaaaaagaa aaaatatata tacattgtgt actttttggc atctggttta ttttgctcaa 48420 

tatcacatct gcgaaattaa tctacactgt gtgtatgaaa ggttggttct ttttgttgtg 48480 

atgcagtatt ccgtcgtgtg actacgggac aatttgctta tccgtattcc tatcggtggg 48540 

catttgggct gttaccaggt tctggctgtt atgaataaag- ttgctatgga tattcttgta 48600 

cactacttct ggtgagcgta tgcactcatt tcgcttatgt aaatatcttg ggtggaatta 48660 

cctgatcata aggtaggtgt gttggctttg taatgtgctg' acttggttat gctgaattcc 48720 

cttttttgtg tatttctggt tagagcggaa catgagggtg tctcttcagg gaatctggag 48780 

ggtggaaggg aagcaggagt cggtttctgg ctcacacatg ttgtgactga actgctggta 48840 

cacctggttg gcatggagct ggcttctcct ttggcgttgc ctactgttgg ggcaggtgtg 48900 

tatgtggtta gctccatgca atgaacccgg gcttctgcaa aatacattaa caacgacaga 4 8960 

gacaacaaaa gctgatgtgg atttaaaggc ttcagttcan nnnnnnnnnn nnnnnnnnnn 49020 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 49080 

nnnnnnnnnn nnnnnnnnnc cagcagtggt tctcaactga gcatagtttt gcctcagagg 49140 

ggacatttgg taatgtctgc agacattttt tgattgtcac agcccagccg agaaggtact 49200 

actagtatct ttttggtaga ggctagagag gctgctaaac atctaacaat gcacaggaca 49260 

ggcctctgta acaaaaaagt atccagtcaa aaatgtccac agtgttgaga ggtttaggta 49320 

agtaggcgct aaaacataag gagactgtgc ctgagagcaa gaaggagtaa ttggaaagtg 49380 

ctggtgtgat tagctctggg ttttagaaag ctcattttgg ctgcttgtag acagtgcatc 49440 

agaggtggag gagggtggta agactggagg cagggaaagt aatttgggag ccactgaaat 49500 

gatccaggtg aaaaacggtc agcaggtgac taggaaagtg gcagaggcaa tggggatggg 49560 

tggctggatg agatggtgaa gaaagcacta taactaacta atgtgtggat gatgggcagg 49620 

aggggtgaag gatgaccaga gtcctgcctt gcaggtctag ttggaaggtg atggtttctc 49680 

ctgagaaagt gaccacaaaa agtgaagcag gtttgtgcgt gtgtgtgtgt gtgtgtgtgt 49740 

gtgtgtgttg agttcagtct gagatgtgtt ggactcacaa tgtccatggg acatccaagt 49800 

ggagaagcat cttgggtgac catatgtgtg agtctgcagc tcagaaacag gcctggggct 49860 

ggagatgaag acttgggaat gatctgcgta tatatttggt agcttgagcc acaagagtag 49920 

atgacataac ccgtggtggg tgtgcagaat taggagagac gtgcaccaag aagccaggtg 49980 

atccccaata tttaaccatc tggaagaata agaggagcct gccaacagaa attgggaggg 50040 

aatggccaca aaggctactg agaagggaag cagttcttaa gaagggggaa gtgaagaggt 5*0100 

atcactactg cagaggtcaa gtaggataag aactgaagaa tgtctgttgg gtttggcaat 50160 

ggggtagtca gtgggcacct gggcaaaagc agttttggtg gagcaatagg gataacagaa 50220 

acaagactgc tatggtaaga ggaggaagag ggtgttgagg aagtggccag cgagtctaca 50280 

ccacttgctg gaggagcttg gctttggtgc aaagcagaga agccagctca ctcattgact 50340 

taacctccaa gaaacacaaa atcatccata tcctggctca aattccagca ctaccaggag 50400 

atggttggcc cctagaaatg ccatcccact tctcctctgc ttatcctatc ctatctgtca 50460 

gtctgttgag cccaggctaa gcgctacctc ctcaagcaag ccttctctgc ctgccgtcac 50520 

actttaagtg atcctgacaa cactgaaaat gtgtgtctct tccattcatg ttagttctac 50580 

acttctgagt atctcctcaa tatattgcct tgttttacta atatgctcgt tctgtttgcc .50640 

ttatttatca gctaccttaa acctccctgc aactagagat tctctttaag tatttgttga 50700 

ataaatgaat gaatcaatcg atgatccaga gcctggtaga ggcttgtgtc catggtggat 50760 

gaggctcaga aaatacctgt agaatcgaaa taaatgcatg tgtgctctga tctaaactca 50820 
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gctaaacttt ctccaggggg taaagttcaa gttgattagt caattgatta attaattcat 50880 

tatgtaatgg aaaaactcct tctatgacct gggcagagtt ataggcagtg aacaagacag 50940 

acaaggtcct tgttgtcatg aagtttgctt tctgaaggag agagataata aacaagaaac 51000 

cagtaagaaa gcaagattat atcattttgg taaatgttct tgtggaaata aatgtgatga 51060 

tgtgtaacaa aagtaccaaa taggagagtg gggtgggtgg gcttctttta gaaagagttc 51120 

tcggagaagg cttatctgag gaggtggcct tttaaccagt acaaatgctt tagcttggcc 51180 

agtggagctg ggaccaggat gacaagggtc acttgtcatg ccagtgagtt tgagcttgta 51240 

gacaagagcc tgatcatgaa agactttgca gatggtggta atgggtttgg gttaattgct 51300 

actatgtggg aagactttga atgggaagca tggggacaat ggcctgtgat acatgttatc 51360 

aaatatggtc gcaggggcta gtgaggtggc agcagagata gggagaagta gacggactgg 51420 

ggaaggtaga agatggggca ggggaggcaa ttactgcaaa gacatattcc ttctaagctc 51480 

actgagtgtt catggtctct gggagcagag gttcctggag gggaaagagg ataatgtcac 51540 

ttcctgagga agcgggaaga acccatctga gacgtgggga ctgtgctggt tcgtttctaa 51600 

ggggccttcc agatctcaca tgccaatcgt cttggtctat gtcaattgtt ggggcatcca 51660 

aatggggaac tgttgtccag gccgatttca cagaacaacc gcccagtcca tatctcccga 51720 

gccattcacc cttgcagtgg cgttagctct ttcaccagct tttatctgcc ccgtggggat 51780 

gttggccaag cccagttaac aagcagttga tcagccccag agatcaggtc cctggagtct 51840 

gtcacttttc tgagggtggg gagagaatcc tggagcagaa catgtaacta gaagggccac 51900 

ctggcttcct atggtctgag ggagagaatg gtgggatctc tggcctgaat caaacctccc 51960 

tttctcagtg tccatcttac ctctctgctg taccttcgtt attttccagc agctcctcag 52020 

cccgttcctg tgggaccctt ctctgccaat ccctacaccc actgtaaatt tcaccgtggg 52080 

agggagatgg gccttgaggg ctgtattagt cttctattct gcataacaaa ttgcctcaaa 52140 

tttagcagct tcaaacaact catgtttatt agctcatcgt gagttcatca gcagtgtggg 52200 

cccagcatgg ctaggttttc tgctcagggt ctcacaaggc taaaatcaag atgttgtctg 52260 

ggctgtgtgc tcatctggag tttagggttc tcttccaggc tcacgtggtt gtggcagaat 52320 

tctgttccct ggagttgcag ggctgaggtc ctgttttctt gctgactgtc agatgagggc 52380 

tgctctcagg tcctcgaggc tgcccacatt gcttgccacg- tgcgtggtct tttccatcct 52440 

tgaagccagt gatggagaat ttcccttgga ttgaatcacc cacatggttg gactctctga 52500 

cttcaggaag agagccctgt ctcttttatg ggatcacctg" attagatcat acccatagag 52560 

ggcagttcct tttccttaaa gtcaactgtg gcatgtaaca tcacacaacc acaggagtaa 52620 

aatccatcat atttacagtc ccagggatta tgcacagtgc accaggggac aactgaattc 52680 

tgcctgtcaa aagggccaag caggacttta ttggtgaaga acagtggaat gtcattcttg 52740 

gttcttccag aaaaaaatca ctcagtaaag ttagaggttc tcttgccttt tgggaagtca 52800 

tcaaagaatc tcatggaggg tttggacctt caccctagaa acatcacacc atgttttcta 52860 

taattgcagg gttcatggtc ccttgaagcc tattcatagt ttccaggttg aaaagctctg 52920 

ctgcagggtg tggggaggga tgcaggtgga ggtgagggct gaatagtgtg agctgcatat 52980 

ctggagctgt ggtggttttt ttagtcttta agctgtcatg tgttgggggt tgggcatggg 53040 

aggggcatcc caagagctcc ttggtattga caccatctcc aaggtgatct ctgctctgcc 53100 

tggtgcacac atgtttttct cctgttgcaa cagcccactc ttgtagaaga gcagacccct 53160 

cagtaccagg tctgaccctg gacagcttgt accaggagct acagcacact cccccacaag 53220 

cctaaagttg ggatgagccc cccgagaatt agatcagaaa agattaaatg cagaggtgat 53280 

ctgtcaggtc ccctttggaa gtgctggtat ggagaggatt gactgagtct gtttaggaac 53340 

ctccaagctc tgtagtaact ttagggctag aaaggaggat gcctaagatt caggatcctg 53400 

cagtgatgag tcaacatttc ttggggaagg aggcagggct gaggattaaa cggagatgat 53460 

gggtatcgtt ctcttgctca aaggcactgg accccaaggc ctccagctct tcgctcccat 53520 

ttgaaattca agtcctgagc acaccacagt tgtgatgcag ggaaagaatg tgcttatcag 53580 

agagcctggg caagtgggcc ccttgtgagt accgttcaac ctcatttatg tcattggcac 53640 

caaaagtaga catcagtctc ttgaaagttt gattaatgct ggtcacactc aaagaccctg 53700 

ggtagcattc atttactaag caattactaa ataccagttt ctgtgctaaa tgctgcatca 53760 

gtcagggctc ttaatggcag gcagcagaaa ctctccttgg ctgatctaag tagaaaaatc 53820 

caggactgaa aggaaacgga gtagctcatg aaattgcagg aagggccgga aaaccagaca 53880 

tggagccaaa gtcaggctgc agaacaggtc tagggaggat cccactgctg ctgagaccta 53940 

gaccttgtgt ctggcaccca ggatgttgta gggctcagac cctggatcaa tgtatcctgc 54000 

agtgcctctg tgggtactgc aactccagga actcaatctt gtcaacgcca ccgccagaga 54060 

gaggccttct tggcctccat ctttttggtc actagctcca gattcaaaat cttgaataga 54120 

tgcttcttct ctttgataga gcccagtcat atgcgttagc tgcaaaggaa gctgaaaatc 54180 

tattaggaac ttttgtcttc aaaaatgaga ggcctgtcct ccaccaagat ccataggaaa 54240 

tggaatccaa gaaaccacag gaaggggtga ggtgactggg cagctcacag catgcatgct 54300 

acatgtgaat tatctcattc atttctcaca ctacccagtg aggtaggtat tgtcatccct 54360 

acttcataaa tgatgatatg aggtacagaa agtttaagga acttgcccag gacacgacac 54420 

gcagctatta agtgctagac ccagtcaatt tgagtctgac ttggactgtc tgactccaga 54480 

agccaccctc tcagacactg ctgtatactt ccagtgaatg ttgatgaaat tttcagggtt 54540 

gctaagctgt ggatttcaga tcctggattg tatgacctaa aagagagact tccctaggag 54600 

tgagggtccc tgaacagtca actggtttcc aagaatgggc tccctctcat caccttatga 54660 
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cagtaatcct ctgtccaaca gccaaagagg tcctgtgggg agggcttgca gatgggagtg 54720 

cgcagagccc agctcaaagc tcctgactag gctcttgttg agtattcctt tgattcctgc 54780 

ttctgtcttt ttaaatcaat ggagacaggg gagggttatc tccatcctcg gctcaagatg 54840 

aaatgcatcg ttcctcgttt ttctcattcc ttcccaatgt gtgtactgtt aactttagtt 54900 

atgaaggaaa ttacagtgtc ctgtgcatat accaaggctg tccaacctcc acacctttgc 54960 

tcaagctgtt ccttctactt gaaatgcctg tttccttccc ttctaattgc atctttccat 55020 

ccaggtagga atcagctcct tggttcatgg agccttttct gctctgtttt actatgcatg 55080 

gacttccttc tgaattagca gaggatgttt cctagcttgg tcttaaccct tctccttttg 55140 

tttgacctca atttactcat cttacaaatt aggttgtaag ctaattgaat acaggatcta 55200 

tgcttcactc tgattttatc tccacctgga tagcatcatt tttgacacac aagcaggcat 55260 

atgggagggg agagaagttt ggtgccagaa agaactggat ttgaattcta accctgttgt 55320 

ttacgtgagt acgttactta accattaatt acttcaatgt atatttatta agtacctact 55380 

atgtgccggg cactgtacta agcaccaagg atacaatggt gagtaaagag atgcagcctt 55440 

caccatcacg aaggaagaca gatgttaatc cattaaccaa gtaatctcac aagaaaagta 55500 

aaatgactaa ctgataagga caagcccctg gagctacaag agggtgtata cagggcatcg 55560 

atccaataag ggcagtgttg cggggagatc aggagccaca cagagcctgg gttgtctcac 55620 

ttggaaaatg gggtatcaac cacctacctc actaggtttt taaaatcagg ttaaatgagg 55680 

taatacttgc catgaacagt attttgttga ttgatgattg attgaaacgg agtctcactc 55740 

tctcgcccaa gctggagtgc agtggtgcaa tctcagctca ctgcaacctc tacttcctgg 55800 

gttcaagtga ttctcctgcc tcagactccc aagtagctgg gattacaggc agccacccct 55860 

atgcctgact aatttttgta tttttagtag agacaaggct ttgcaatgtt gaccaggctg 55920 

gtctcaacct cctgacctca aaagatccac ccacctcagc ctcccaaagt gctgggatca 55980 

caggcatgag ccactgcatc cagccacttg ccatgcatgg catttaaaaa tgttcagtaa 56040 

atgttaccat aatgaaggct ggtaggttgg ccaactgagt ggtctgattc agaaggaaag 56100 

aagttagaca tacgtgaaca tttcctgtac ttgaagatcc tcaggacagt gactcctaga 56160 

cccatcttcc atcacagtca gctgggaagc ttttaaaaaa atgcagacat ctgaccttca 56220 

cgctagacct attagccaag cagaagtttc tgggcagggc atctgcatat ttttaaaaat 56280 

ctttaataag gcagcctcaa aattacagat tcagcacgca tttaccataa ccactgaaga 56340 

aatgcaaagt tataaaaaga agataaacaa caatctgtct cctgctttct tccctctcct 56400 

cccctgcttc tggaggcaac aaggtcaact atttggtgtg attcctttta gcattccctc 56460 

catcaatggt cacataagga tgctcacaga taagcaccta tgcgggggtt ttttttttcc 56520 

ttgtaaaact attcacatac taaatacttt cctcagtatc ttgccttttt tcacttcatg 56580 

tcacagaaac atctcttcag gtttatagat acaggtccag ctcttctttt catagccata 56640 

taacattctg tagaatagag aggacacatt ttactcagtg tccgattgat ggatatcaat 56700 

attgttttca tttctacaaa tagtcaagga ataacataac tctgtaaaag ttttattact 56760 

tataggcgca tttatgccta aaggatagtc tcaaaagagt gaaactgatc aaatgtgcat 56820 

ttttttattt taataggtat ggacagattt gttctcaaaa tgtttgtggc agttcaaaac 56880 

accagtaaaa caggggagat atgtattttg gaaaagcacc caaggcgatt ctgaagtgta 56940 

gcccaggata agaaccattg cccagagctg ttccagatgg cccctgggtt cctgaagtgg 57000 

gtatcgggag agaaatcttc actgaatgaa tgagtgggct ccccagggaa gtgatgaaat 57060 

ggtccttatc agccttgcta tctccctctg acagaggcaa actctctctc cctgggggaa 57120 

gttcctccaa ggcctctata taagaagtct ttgtgagagg aagcaaagaa ggacctgggc 57180 

tttgggaaga tctaaagacc caggaaggtc tctgggtggg tgagtgcttt ctctgctgtg 57240 

gtggagctgg tgacagttta ttctcccagg aggtccctgg ctgtggctga cagtttctgg 57300 

agggctggca ggcgtctacc tgtggctttc aggttatgag gatgtcagca ggggcagcct 57360 

tcatcctctg ccttgcacat tccttctgcg ggatgtgaaa gtgctccttg gctggggaaa 57420 

ggagatggtg gagacatgga ggagggtgtg ggtggcttct tgaactctga ggaggggaca 57480 

taccttctaa gtcctatgtg ttcctaggaa agccaataat cattgcttct cccgcctttt 57540 

ttatgtcata gactctgagg gacccattaa gtacaaacaa ataagcgtaa tagtcccttc 57600 

tttacttccg ggcctgaagg aaagccagcc tcagccaccc ctcagggttt gctgcgttct 57660 

gtttagaaag aggtccttgc gtcctggatc ctggagcatc aggagctggg cttggcatga 57720 

gcttttctgg cccatcctga tttctattca ggccttcttt ttctccacct cactcccacg 57780 

gtcccctaat ggtgtgattg tgatgtgtgt gcatgtgtgt ctgtgtgtgt caatgacaaa 57840 

ctgtgttctc cgttgcagga taaagccaag atgaaactcc ccttacttct ggctcttcta 57900 

tttggggcag tttctgctct tcatctaagt aagtgttttt tgccttcagt ctttctttct 57.960 

ctgttttttc cctttctatg gtagatgggg tcagagttac acacccaccc ccttctttga 58020 

tcgtcttcta tttctgaatt tctgtgtgct taaagggatg gggactctat ggccaggagt 58080 

tgaaaggatt tctcaaggcg tctgttatgt ctgtggtctt ggttctactg tgacattccc 58140 

aattttgtcc tttctccatt atgcttactt tgagcttact gagtgccttc tctcctttaa 58200 

ctctcttagc atcgccatga agtaggtggt attgtatacc catttcacag aaatacagct 58260 

ggtggatgat ggaaccagta cccaagccca tgactgcccg actctaagtc catgctctta 58320 

accaccttga ccttgtcagg cagCttgggt tcccctcata gagactgggt tccaggttcc 58380 

ccttcccagg cagagttgag cactctgatg cccagggcaa ggtgtgagct gtctgtggtt 58440 

ctggggagga acaaggggag atgtgaagga aggacactta gctatcctcc ctgccagggt 58500 
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ctgagacttc cacctttgag acccctttgg gtgctaagac gctgcctgag gatgaggaga 58560 

caccagagca ggagatggag gagacccctt gcagggagct ggaggaagag gaggagtggg 58620 

gctctggaag tgaagatgcc tccaagaaag atggggctgt tgagtctatc tcagtgccag 58680 

atatggtgga caaaaacctt acgtgtcctg aggaagagga cacagtaaaa gtggtgggca 58740 

tccctgggtg ccagacctgc cgctacctcc tggtgagaag tcttcagacg tttagtcaag 58800 

cttgggtgag tggcctatgg ctgaggctga ggtgggagca tggaacgggt gtgggatatg 58860 

cccccagcat tgctatcact ggctcttttt cccattgagg gccctggggg tgtcagtaga 58920 

acctgagcct cagagaggtg ttggggtaag aggggagggc cacctacaaa cagaagttgc 58980 

attttggtct ccaaccttca aatggttgtg gcaggggagg gagggaatga attgtgggga 59040 

ctcaagaccc atgtgaattc atgtaggaag gatgctccat tctttgtctt ttatcctgcc 59100 

ctgtagttta cttgccggag gtgctacagg ggcaacctgg tttccatcca caacttcaat 59160 

attaattatc gaatccagtg ttctgtcagc gcgctcaacc agggtcaagt ctggattgga 59220 

ggcaggatca caggctcggt aagagaagtg tgaacactaa atggggtgca cctgctgatc 59280 

tcagccagca ctcagcttgc atcagatttg tctgtttttc tcctgtataa tctccagaag 59340 

aaccagggat agatggacac ccacagacaa cactgagggg gctgcctggg cattcaggga 59400 

agagctaagg atttagaatc aggaggtttg ggtccaagtt cctttccatc tctcactatc 59460 

tatgtaactt aagttagctg ggcatggtgg tgcatgtctg taatcctagc tacttgggag 59520 

gctgaggcag gagagtcact ggaacctggg agacagaggt tgcggtgagc cgagatggag 59580 

ccattgcact ccagcctggg caacaagagc gaaactccgc ctcaaaaata aataaataaa 59640 

taaataaaat aaaaaaaaaa ttaaaacaag accatgagtt tgtttcctca tctctaggat 59700 

gagttggcaa cccttgttct accttttgtt agggctggaa ggacaagcct gtcactggga 59760 

tgcatagaat ctgatggtga taattgccgt ggatcagcat ttcagatgac taggacagtt 59820 

cccatcatgg tccagcaggg aagggcccat tgcccggtgg gcagcagaaa gagctggcag 59880 

atacggggcc aggtctgctt ctctgccttc cctctgcccc atcccttctt cccctcttgc 59940 

tttctccagg gtcgctgcag acgctttcag tgggttgacg gcagccgctg gaactttgca 60000 

tactgggctg ctcaccagcc ctggtcccgc ggtggtcact gcgtggccct gtgtacccga 60060 

ggtgaggtgg ggctggggat gaacgatgga aaggtctggg* agatgggaag tgccccaagg 60120 

aggagatgct acaaagagcc tgaccctttg tgggagaggc ttcctgggtc ttttatatac 60180 

tctgactcca cagcagtgtg tgggtgggaa aagaggccct' cctgtgggtt gagttgggat 60240 

ggacaagagg ctgaaagtcc ctttctgttc tgccttcaca ggaggccact ggcgtcgagc 60300 

ccactgcctc agaagacttc ctttcatctg ttcctactga gctggtccca gccagcagtt 60360 

cagagctgcc ctctcctggg cagctgcctc ccctcctctg cttgccatcc ctccctccac 60420 

ctccctgcaa taaaatgggt tttactgaaa tggatttatt ttctcctctg atcgcggatc 60480 

cactctgctt agccctcatt gaaacttctt ccttatcatc tctccccaca ccacaacttt 60540 

catagaagtg tcagaagcta ctactccttg aggaggagga tggagggtgg agttgggtct 60600 

atggagcctt ttggagatgg aggaatgggc tcagctagtt ctcttcatag aacacctgat 60660 

tactgggcac ctgcatagtg ctgccaggac ctttcaaggt tgtaggtaga ctcccaatgg 60720 

cccagtttgc atctctgtaa ccaaaggcct tttctctctc tctctccaac cccagaactg 60780 

tggttggttt tatatgtaag gaagttaaca tgtccctggg aacagtccac aacattcagg 60840 

aatgaatgta taagtaccgc aatccccggc ccctcaagtg gaataaatct aacatgtatt 60900 

gggcaccatt tcccagtggc ctgctgtggt agttggcctt attccatgca tttttatggg 60960 

ctgccttccc ttcctcaact gcattctctg ctccttccta ctctctgcaa ctcccaaata 61020 

aacacttgta cgcaactccc tctctcagga tctccttctg gggaaacctg atataagaca 61080 

gcttgccatg cgtcagactc tgaatgaggc ctgggaatac aagacatagt cctctggcac 61140 

ttgggatata tggttatttg taacataggc acaaaaacat ctactagttg ttatcgctta 61200 

ttgagcaccc acaacatacc ccctgctgtg gcaggcacct tgcctagatg acctcatgtg 61260 

atcaataatt atgagcccta ttttacagaa ccaggctcag agaagttagg atctgtcaaa 61320 

agacttgccc aagactgaac ctctaaatgc aactcatatt gaaattcaac tctgctccaa 61380 

agcatgttac tttaaccctt gtgcttttac agctggctac tctcccctta tggtcacacg 61440 

gggatgaagc acggggggag gaaagccaga ctgtctcact cttgggttca tcttgggaca 61500 

caggacacca gcccagctgg aggtgaggga gctttaatca gaggggaggg aggaaggcat 61560 

tctcaacccc ttctgtacta gggaggtcag cagaagaaaa taattcaatg ttctaaagcc 61620 

atttttttct ccagcattcc tccaattcat agatcttcat atgggattag gggctcagag 61680 

aggggtgaaa caagaactct atttttttgg agtgtggtat agagaaggga tgctacttct 61740 

ctaaggtcac atagtaagtt gagaaagaga gagaaatcaa actcaggttc atttcaacta 61800 

ttgttccaca agaatctgtt gatttcaaag atggtggact atgggttcat ccctgtggtg 61860 

agtgctgtga ggatgcagct gaggtggaac tttcactcct tgccctcttg gactttatat 61920 

tctggtgtgg aaaggcattg cttcccttat ttcaatatta acaacaaagg gtaataatat 61980 

ttcccattta ttaagcattt actaggtgtc aggtactgtg ctaaatgtta ggtgaacttt 62040 

gtcttgttcc tcataaatct ctgccgctgt gggtgtgtac tttgacagaa gtttgacttc 62100 

cagtccacag agatcttctt tgggggagta atatcaagaa ggggcacgaa ggaagctgca 62160 

gggctcctag tcccatcctg tatctcgacc taggcatgtt tacattggtg cattcactgt 62220 

gaagtttccc tgagcagtcc actctatagt gtgctttata ggagcacatt gtacatccat 62280 

tgaaaaattt ttcttggccg ggcacggtgg ctcatgtctg taatcccagc actttgggag 62340 
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gccgagacag gcggatcacc tgaggtcggg agtttgagac ctgcctgacc aacatggaga 62400 

aaccccgtct ctactaaaaa tacaaaaaaa ttagccgggt gtggtggcac atgcctgtaa 62460 

tcccagctac tcaggaggtt gaggctggag aatcgcttga acctgggagg cgaaggttgc 62520 

agtgagccga gatcgtgcca ttgcactcca gcctgggcaa caagagcgaa actccgtctc 62580 

aaaagaaaga aagagatttt ttctttttct taaaaagtaa aaatcatgaa ataaggggac 62640 

tgggctaata ttccaaaata tgggtttgtg tgtgaatttt cctctccagt aagatactaa 62700 

ctaagctctg tgaaactgtt tatctatggt tctttatcat tgaatccttg gagttcctta 62760 

cactgtgcag agcacagagt aggggctcaa tcaacagtgc actcattgct ttttcataga 62820 

caagggccac cctcactcaa ctcatgtgcc aggcatagtt ctgagagctt tgcttaagct 62880 

gatnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 62940 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnggtcact aatttggtat 63000 

agagttatgt tcattgattt cattttattt tgtttctgat ttttaaagat tgttttactt 63060 

gttttcttcc tattattatt ttattttatt tgtaaaacat ttacatatca gacatttaca 63120 

ttttcccaaa ggtaaaactg tgaaacaaga tatattcaaa gaagtttact ttccctctct 63180 

gtttcttgta ccccttttcc tcttctttag gtaaccattt ttattttttt aaatataaac 63240 

attgtgtagg tgtatataca tgtattagtc tgttttcatg ctgctgataa agacctatct 63300 

gagactggga agaaaaagag gtttaattgg acttacagtt ccacatggct ggcaaggcct 63360 

cagaatcatg gcaggaggtg aaaggcactt cttacacggt ggtggcaaga gaaaaatgag 63420 

gaagaatcaa aagtggaaac ccctgataaa cccatcagat ctcgtgagat ttattcacta 63480 

tcacaagaat agcgtgggaa agactggccc ccatgattca gttaccctcc cccactgggt 63540 

cccacccaca atacgtggga attctgggag atataattca acgtgagatt tgggtgggga 63600 

cacagccaaa ccatatcaat acatttccct ctcttttaga taaaaggtag tatactgtat 63660 

acactattct gcagagtttt tttttttttt gatgtaactc tatcctgagg gtgctctgta 63720 

gcagggacct ctcatgcctt ttaaccactg cctgggtctc cattacatgg ctgcagcata 63780 

gttgccacag cattcctgta ctgatgacta tttggattgt ttccagtctt ttgctattac 63840 

cagtagtgtt acaaagagga tctggctaca tgttcagggt ggggaggggc agatgtgtag 63900 

cctgtcagga gggtattgca gtaatccatg actgagttaa ■ tggtagttta aagctaggat 63960 

gagtcagtgg ggttggagag aagtgggcac atttgaatga tatgtaggag gtgaatgatc 64020 

agcattattg atgagtttga ggtggggcat gtggggaaag gattcgagga tgactcccag 64080 

gtttctgttg ggacagtgga tggatagtgg ctcctcccct ttttccaatc ttccttggcc 64140 

cttcgctgac ttctgttggg ttggcctaca gagagcttct ttttcctctc tgttcgccca 64200 

ggttcctcca ctttggcggt ggccctctgc tcgacggtgc cttcgctggc cctgacatcc 64260 

ctgctgtgcc tgggcttcgc cctctgtgcc tcagtcccca tcctccctct ccagtacctc 64320 

accttcatcc tgcaagtgat cagccgctcc ttcctctatg ggagcaacgc ggccttcctc 64380 

acccttgcgt aagtggcctt ggggcgggct ctgtggagac ggacacactg gggcaaagag 64440 

aagctggagg taaagaaatt gggaggcaag gcggggcctg gaggcagtca ggtgcgggag 64500 

actgggtttg ggggcaggtg tggagggggt gagaccagag gtggtgggaa ggatagaaca 64560 

ttcatgcact tgagccttta catctgcggt gccctctccc tctgttttct acctggtgaa 64620 

ctcgtattca tcctctgagg cccacttctg tttcagttct ccagggaaga aatggaaaag 64680 

tgtcttccct tctttgtgcc cttagtactc tagtcttact tcctttgcta gtgcgtgcat 64740 

tgtctggcat gccatccatt tacatgcctg tcttttcttt cctggtgcag cctgcatgag 64800 

ggtcctgtct gtttttccag ggccccgcat gtgccttctt ctgggttctg tgggtcaaat 64860 

gtctgagcag agctgaagag ggaaaggcca gacaggtgtg gttggagggc aggcctagga 64920 

caggggagct ggggacaagc ggccgacagc ccccagaggc caggcttctg cttggaggga 64980 

gggtccctga agctcactgg aacccctctg gtttctctcc ccagtttccc ttcagagcac 65040 

tttggcaagc tctttgggct ggtgatggcc ttgtcggctg tggtgtctct gctccagttc 65100 

cccatcttca ccctcatcaa aggctccctt cagaatgacc cattttacgt gagtactggg 65160 

aggatgggga tccctggcag gaggcctggg ccttaggcct tggctgcccc aaatctggct 65220 

gtgatggcct gggtatgtag catggtgcag cttcccaaag ggtctgtgtt attcaagtat 65280 

ttggggcaaa agtatttgtg tgtgtgggga aacagacatt ctggagtagg gtggggaatt 65340 

ctcacgaaac ttcaagcaaa atcctgagac ctcaaaggtg tttcctgctt gtggtgagtg 65400 

caggcccacc ctggcctctc ccctaggccc acacagggtt tccacagttg gccccaggga 65460 

caggacctct gtgctttcac ctctgtgtcc ttacacctgg agggatgctc tgaggtcctg 65520 

ctctaggagg tggtcgtgag tctcctgctc tttgcagaaa ctgaggctca aagaggttac 65580 

ttacgtgttc agaggcacca gctaaggagc aaaagtcaac tttgaattct gtgttttgac 65640 

tactgcacag ctctatttgc ctcatttttt atttttaaag cagcaaatct tagaatagga 65700 

gtttaaatcc atcacttgga gaaaagaaag actaaatgtt ttttgttttt gttttggaga 65760 

cacgatcttg ctttgtcacc caggctggag tgcagtggca caatctcggc tcactgcagc 65820 

ctcgatctcc tggactcaag cgatcctctc atctcagcct cctgagtagc tgacactaca 65880 

ggcatgtgcc accatgccaa gcttatttta ttttattttt ttgatagaca ctggggtttc 65940 

gctatgttgc ctgggctggt tttgaattcc tggcctcaag cgatccaccc gtctctgcct 66000 

tccaaaatgc tgtgattaca ggcgtgaacc actgtgcatg gccaaaagag taaacttgaa 66060 

atctgaggcg aatgacttga ttgtgacatc aggtgaccta gtaatcagct gtgtattcta 66120 

gctggtgcct ctaccagctt cccatgtgac cttgaacatg tcattgaatg ctcgctaggc 66180 
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ctctgtttct ttatctgtga aatgggcttg atattcctcc tctaccccaa ccgatagtgc 66240 

agaatgaaaa gtaactgaaa gtccttcctc cagggcacca tagtgtctgg gtgaaaagta 66300 

gaatataaac tcggtagact tctggtccct tcattggtca tggaatggac cagtgcttgc 66360 

ttcattgagc aacagttctg ttgttcagaa ttcctggatt tcacctcact tctgctctcc 66420 

ctgcaggtga atgtgatgtt catgcttgcc attcttctga cattcttcca cccctttctg 66480 

gtatatcggg aatgccgtac ttggaaagaa agtccctctg caattgcata gttcagaagc 66540 

cctcactttt cagccccgag gatggttttg ttcatcttcc accacctttg aggacctcgt 66600 

gtcccaaaag actttgccta tcccagcaaa acacacacac acacacacac acacacaaaa 66660 

taaagacaca caaggacgtc tgcgcagcaa gaaaagaatc tcagttgcca agcagattga 66720 

tatcacacag actcaaagca aaggcatgtg gaacttcttt atttcaaaac agaagtgtct 66780 

ccttgcactt agccttggca gacccttgac tccaggggag atgacctggg ggaggaagtg 66840 

tgtcaactat ttctttaggc ctgtttggct ccgaagccta tatgtgcctg gatcctctgc 66900 

cacgggttaa attttcaggt gaagagtgag gttgtcatgg cctcagctat gcttcctggc 66960 

tctccctcaa gagtgcagcc ttggctagag aactcacagc tctgggaaaa agaggagcag 67020 

acagggttcc ctgggcccag tctcagccca gccactgatg ctggatgacc ttggcctgac 67080 

cctggtctgg tctcagaatc acttttccca tctgtaaaat tgagatgaat tttggtgttg 67140 

aaagttcttc ctggagcaga tgtcctagaa ggttttagga atagtgacag agtcaggcca 67200 

ccccaagggc catgggagcc agctgacctg cttgaccgaa ggatttctga cagactatct 67260 

ttggggatgt tttcaagaag ggatataagt tatttacttt gggcatttaa aagaaaattt 67320 

ctctcgggaa taattttata gaaaaataaa gcttctgtgt ctaaggcaac tactgtttcc 67380 

atctctctag gctttgggcc ggggctgtgt gtgtgtgtgt gtgtgtgttt gtgtgtatgt 67440 

gtatgtttct gaggaggccc taccctggca tgagagggta gggaatctgg ctacacatct 67500 

agtgtggcag ctggacccag aggtggggca ggaaccctga ctatgattca ccccgctggt 67560 

cctgggatgt gggcccagag acttcctccc ccaggaaccc ctctgcttcc tcttcctctc 67620 

cacatcctta actaacttta gcagaaccct actcctcact acacaccccc agctagaagc 67680 

gctggatgga atcagaaatt cctagtttga gtttcaattc tgcccctcag cagctgggca 67740 

agccccttaa ccactctgag tcactagttc cccacctgca' aagtgcagtt aatcatttct 67800 

atctctgatg gcgattgtga gaatgtaaag tcattgcaac tgcctagcac atggtaggag 678 60 

cacatgaggg tttgctcctg tgtttactca tgacccttgg ggaggacggg ggcaaagagg 67920 

gagaagttga gggtgcagga ggagagatgg caggtgggtg ggatgggaga atctggggca 67980 

cacctgctgt ctcattccca ccttgctagg agagggacta ggaaagaaca gtgggaggca 68040 

gggg<?atggg ggtggaaggc agggggtggc aggcaggttc atccatccat tcattcaaca 68100 

aatgtttatt gagcacctgc cacgtgtcag gccctgtcct gggtgctggg gctataaaga 68160 

tgcagaaggg tctgaaaccc agctcttcct tcttcctgtg gatgtcgggg tgtaatttcc 68220 

aggggccagg agcctgggtc tgagggcgga caccaaagtt ctagtggtgt ctattagcag 68280 

cgtttaaatc taatggatgg atttggtctt gttaccctgc tcaaaagctt tcagcagctc 68340 

cccactgtcc acaggacaaa aatccagatg ctagcctggc attcaaggct gtcactagtg 68400 

tgatctcaac ctctcccctt ccctctttac ctcctaccaa cagcggggca gagcccaccc 68460 

ctgtggacca agattcccag tctctgggtc tgtgtgtgca ccagttcctc tgcgtgggtg 68520 

gctcaccctg cctcagcttg tgaaatccat ctggtctgct gggatcctgc tcaaaatgtc 68580 

atcttctcca aaaatcatta ctcaggcttt ccagcatgtc tgagtccctg gcacttggtc 68640 

acacccttcc tggtgactgg catttgcctc cacatcatga ccctcccacc ccttgcctgg 68700 

gcagcatact ccaggaggca aggtctgttc tcgcctggct ctaattaatc tgtgcttacc 68760 

atccacatgg taccagctaa ttcttgttga atgaatgatc gttgaatgag tggattcttg 68820 

ttttggcctc agaaccaatt agaaggagcc agaaaaacac atgggggtgg gggaggtgca 68880 

gtgtggtgca gtggaaaaaa acccttctgg aaatctcagc tctgtcactt actttgtcag 68940 

ctctgtgact ttggatggac cacttctttg tcagtatggt gggagaaata gacatgcctc 69000 

tctgggctgt tgtaaggatt acaaattagg tcgagtgctt ggcatgtggt gggttgaaca 69060 

gatcacagct agcattacag atgatatatt aaagccaaaa aaagatgcct aatgtccacc 69120 

agttggtgaa cggacaaagg aaatgtacca tatttgggat attatttggc aatcaaaaaa 69180 

agtactgaca cctgctacaa cacggatgaa tcttgaaaac attagactaa gtgaaagaag 69240 

ccagacacaa gaaactgcta atgattccat ttaaatatga aatatcgggc cagggtgcag 69300 

tggctcatgc ctgtaatccc agcactttgg gatgccaagg tgggcagatc acttgaggcc 69360 

aggagttcgt gaccagcctg gccaacatgg cgaaaccccg tctctactaa aaattagccg 69420 

agtgtagtgg catgcacctg taatcccagc tacttggttg gctgaggcac aagaattggt 69480 

tgagcctggc aggtggaggt tgcagtgagc caagatcgtg ccactgcact ccagcctgga 69540 

tgacacagtg aggttccgtc tcaaaaaaaa aaaaaaaaaa ggaaaaagaa aaaaagaaat 69600 

ttccagaata ggccaatctg tagaggcaga aagtagattc atgattgggt aggcctgggt 69660 

gtggaggcca tgggtagtga tggctaatgg ggaaggggtt tcttttgggg tgatgaaaat 69720 

gggtggactt atggtatgtt aattatacct caataaaact gttatttaaa ggaagaaaag 69780 

atgcctggat tccccaggaa gtgtacagta gacttctgtg agaatcagaa atgatttctg 69840 

gggaagatgg gcgagaggag agtaagtggg agaagtgacc acgtgcgcaa ctctcatcgt 69900 

tctgccctga gagccttcct cctgcaactt tatttattta tttattttga aacaggttct 69960 

cactctgtta ccctggctgg agtgcagtgg tgtgatctca gctcactgca gcctcgacct 70020 
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gccaggctca 
caccacatct 
cactctgttg 
tcccaaagtg 
cctcaagtgg 
tcagcaacgg 
tggaggtgaa 
ggagaacaca 
ccttttaagc 
aaagtcaggt 
tcccaaggtc 
tcttcttctt 
tcttcttctt 
tcttcttcct 
cttcttcttc 
tcttgctcta 
ctggccttaa 
caccatgccc 
aaataaagag 
tggttaacat 
tgatctcata 
ctgctcacaa 
tccctttccc 
ttggtcagca 
gtgggcttat 
tcactatcca 
aggtgctgct 
gcgagaggga 
gagccttcct 
ctcactctgt 
ctcccaggct 
tgtctaggag 
ggtgtgacat 
catccaggct 
aatgtctgag 
gattttgagg 
tctctgtcgc 
cggttcaagc 
ccacacccgg 
tggtcttgat 
caggcatgag 
tgctcacagg 
gctggctaga 
tgtgacctta 
cagtttgttg 
ctggagatgg 
gattcctggt 
gagttgtaag 
agtgagtgat 
gttattagta 
ctggaatgcc 
tgtctccacc 
gcaaaaaaaa 
atcaatcttc 
tccaagcaca 
gggctgggga 
tgtccttgta 
tcagagagta 
taggtccttc 
tcaagtacgc 
cgctctgtca 
tcaagcaatt 
gccagctaat 
ggtctcaaac 



agcaatcctc 
ggctaatctt 
ctcaggctgg 
ttgggattat 
aaccaatgag 
gcttccaatg 
ttggccccca 
tgccgtgtaa 
ctattcagcc 
tcacccatta 
acattcattt 
cttcttcttc 
cttcttcttc 
cttcttttct 
ttcttcctct 
ttgcccaggt 
gcaatccccc 
agcctgcatt 
ggacatgaaa 
gttggtatgt 
agcttttcct 
ccccatctcg 
cttccctgcc 
gattggagag 
cacaaaagag 
taatctggtg 
aagaacactt 
gtaagcagga 
cctgccacgt 
cacccaggct 
caagcagtcc 
ccctttttcc 
gggcacagca 
tccaggcata 
tgtatcttaa 
tgttttgttt 
caggctggag 
gattctcctg 
ctaatttttg 
ctcttgacct 
ccaccgtggc 
cccctcagca 
gcagtagggt 
accaccctga 
caaggactag 
actgtccaag 
acctccgctg 
gattgagtgg 
caatataaac 
acatctgtgt 
actattatag 
gaacaaaaca 
aaaaagactt 
atattcaatt 
cagaaatgtc 
agagtgcagg 
ggagtcatga 
ataaggataa 
attaatggca 
tacatgacag 
cccaggctgg 
ctcctgtctc 
tttttttggt 
tcctgacctt 



ctgtttgagc tcctgagtag 
ttatttattt atttatttat 
tgtcaaacgc ctggactcaa 
gggtgtgagc cactgtacct 
gaagcaagca actcagagct 
caacccttct ctcctgtaac 
tcacaccccc acagtgccaa 
gggacagcca acagcataaa 
cagggaagtg catatgatca 
gcacattgtg gggctggtat 
agacattacc tttactttac 
ttcttcttct tcttcttctt 
ttcttcttct tcctcttctt 
tcttcctctt cttcctcttc 
tcttctttct tcttcttctt 



tgaatgcagc 
tgcctcggcc 
tattctcttg 
aagacgcatc 
tccctcccgt 
tatgtaatca 
tggttctgcc 
tgtgtggggg 
tgcaaataat 
caccttggga 
ctaactgtac 
ctattctgtg 
gaaacaaccc 
ggttttgttt 
ggagtgtagt 
tccccaaatt 
tgccagcccc 
gtgagtcatt 
agatccagtc 
acccaggacc 
gtttgtttgt 
tgcagtggca 
tctcagcctc 
tatttttaat 
cgtgatcctc 
ccgcccaatt 
tattctttcc 
gtgtgggaaa 
acctcagttt 
agagtaacct 
accaacttcc 
cttcatctgt 
ctggatgaac 
cacctggctt 
gcccttcaga 
ttcaggatag 
acacaagttc 
tcgagttacc 
tttgcgatac 
aattaccagg 
cagacagatc 
agcatcttag 
gtaaggctct 
gctactaatt 
tgaaacagtt 
agtgcagtgg 
agcctcccga 
atttttagta 
gtgatctgcc 



atcatcatac 
tccaaaagtg 
taagaaagat 
accattaatt 
catttgactg 
aatagtagcc 
acgccttgtt 
tcctagatga 
aaaaacactc 
tattccagag 
tttagctgaa 
agaatcagag 
acaggcacag 
gtttgtttgt 
ggcaagatca 
caaagcttgg 
tcaggggatt 
cctctgacat 
ctctggtgat 
atggcttgtg 
ttgtttgttt 
cgatctcagc 
ctgagtagct 
agagactgtg 
ccgcctcggc 
ttgagttttt 
tgggtccagt 
gctctgggct 
cctcacctgt 
tgggaataaa 
agttcctccc 
aaaatggagt 
gtcaagcact 
gtagcaggtg 
gcgtgcacca 
aaaacctccc 
tttattactt 
ccaaatctta 
ttttgtctcc 
ggcagggcta 
tgggtcctgt 
tgcctttgtt 
acgtgaaaag 
tttattacat 
tttttgtttg 
cacgatcttg 
gtagctggga 
gagacggggt 
caactcagac 



ctgggactac 
ttatagagat 
gtgatcctcc 
ggcacctcct 
ttcacaagtt 
cagcctcagt 
gctgggccct 
ataggaattg 
gccccatttc 
ttaaaccagg 
atttcttctt 
cttcttcttc 
cctcttcttc 
ttcctcttct 
cttttttttt 
ctaaatgcag 
ccaagatttc 
atcatttaaa 
ggaccactca 
gatgtatgtg 
aaaaacatga 
tatccccatc 
cggtgagcca 
agaaggcgag 
aatgacctca 
ggtgctggca 
atgatttcta 
ctctcatctt 
ttgtttgttt 
tggctcactg 
agtgatggtc 
gatgactctc 
tctttgggaa 
aaggagttca 
ttcacaccag 
gttttttgag 
tcactgcaac 
gggactacag 
tttcaccatg 
ctcccaaaat 
atgttctaat 
gtcacctccc 
ttgcaggcac 
aatggaaata 
aggtagcagc 
cacacaagct 
aacaatagga 
tcaaagggga 
tgctgtgtgt 
cacttcacac 
tgcaagcact 
ggaatgggaa 
agccaaagtc 
ccagcagtca 
tgaattcctt 
tatcacgttc 
tgctacctat 
tgctcggccc 
acgcaaaatc 
tttgttttga 
gctcaccgca 
ttacaggcat 
ttcaccatat 
tcccaaagtg 



aggcgcatgc 
tggggagtct 
caccttggcc 
gcaacttctt 
ttgatttcaa 
agagaggaac 
tccatcaggg 
tgtgatgatc 
atagatgaag 
tctgtctggc 
cttttcttct 
ttcttcttct 
ctcttcttcc 
tcttcttctt 
tgaggtgggg 
ccttgaactc 
aggcatgagc 
acagacgaga 
gagataatca 
ataatttaaa 
ttttaaatgg 
caccccctac 
gagggcagcc 
ctgttgtcaa 
tacccgctaa 
ggtcctgccc 
gggaaaatgg 
tctgccctga 
cagatagggt 
aagcctcgac 
ccagtggtta 
aaatgcttca 
gaacattttc 
cagacaggac 
accctccagg 
acagagtctc 
cttcgcctcc 
gtgtgcacca 
ttggacagga 
actgggatta 
cccaaacatc 
aggcctgcag 
tgatcagctg 
ggtaccacgg 
agcttgggct 
ctggcactta 
atactttata 
cctggcatgt 
ggctgcaggt 
cttgtggagt 
cgctttagct 
aacttcaaag 
aatgaaaaat 
atggagagaa 
tcagagccct 
ttagattggg 
aatgcctacc 
tggcacatag 
acattacagg 
gacagagtct 
acttctgcct 
gtgccaccac 
tggccagact 
ctgggattac 



70080 
70140 
70200 
70260 
70320 
70380 
70440 
70500 
70560 
70620 
70680 
70740 
70800 
70860 
70920 
70980 
71040 
71100 
71160 
71220 
71280 
71340 
71400 
71460 
71520 
71580 
71640 
71700 
71760 
71820 
71880 
71940 
72000 
72060 
72120 
72180 
72240 
72300 
72360 
72420 
72480 
72540 
72600 
72660 
72720 
72780 
72840 
72900 
72960 
73020 
73080 
73140 
73200 
73260 
73320 
73380 
73440 
73500 
73560 
73620 
73680 
73740 
73800 
73860 
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tggcatgagc caccgcacct ggctgtgaaa cagttttatt gtgtttctgt ggaatgtgtc 73920 

ctacccaacc tatagctaac tcctatagtt ccctcagttc tcagctcaga tatcccttcc 73980 

tttctgtact gttacctagt actggttttc atagcaccag gtacctctct ggcatagagc 74040 

ttgtcacagt tgcagtttaa tgtaccatca taggatttta aaaatattca gttgtgtctt 74100 

ccattaggct ttcatttggg aactccacgc aggcagcagc tgtatatttt gtattgccta 74160 

ctgtatcctg agaactttgt accctactta gcacagaatg gaggctcagt aaatactgga 74220 

catgagagag agagagagag agagaggaga gggagagaga gagagagaga ttcaacctac 74280 

aatcccagct ctgagcttct agttccctga tggtgaggac tgtgatgtgt ctcacacggt 74340 

aatgagcact tatgcagaag aggctcagaa aatttctcct catggccaac ggaagactta 74400 

gagttctttt ccaagctcca ccgtttgctg gcatgcaaaa tttggactat cacttaagtt 74460 

ttccaagcct tgctttttct atccctaaca taggacaata ttcagcattg ttgtttgttt 74520 

gttgggggca ccatgtttca ggcacttagt agattattgt accaccacat ttcaattggt 74580 

cctcctcaag ccctgcaaca tctgtgaggt ggtcatcctt aacaactcac agatgagcaa 74640 

caggagactg gggggatgag ggaactgcca aggaggtcca gcttatgggc agcagagcca 74700 

agaatggaac cagggtcttt tattttttta tttttttatt tttatttttt aaccagggtc 74760 

ttttaacatc cgaggaccac attctttgtg ctttccaaat catcacctgc cccatgcaac 74820 

ttacagggta agttacatta aacaacgtat gtaaatggct ttgtgctagt tattcaccac 74880 

cacaggggaa gtgagtcacg gacaagagtg cagccgctcc attcggatcc tggctctgac 74940 

acttacctgg aaaatgactt aaccattccc aggatcagct gtttgtctgt aatttaggta 75000 

gtttaatggc acttgtgtcc tagagttgtt tagaaggttg aataatatgg agcacttaac 75060 

atacttagca cctagaaaca cttcctaaat attagttgct gctgttgtta tcgttattaa 75120 

aatttctgcc taagatctca tttcagggag cccaactcaa tctttgacaa gcttaaacaa 75180 

aaattgcttt tcttcattta ttcacttaca cagcaaacat gaattgagcc tgtactgtgt 75240 

ttccagaact gtgcaggacc agagaggcac aggtgaagga agcaaggctc tggctctact 75300 

ggggaaacag caagaagatt gctacaatga ggtgggaaga gggctggact agagagaagc 75360 

cctgattagt gtccttgcta cctttctctg ggagagccaa ggcaggcttc ctggaagagg 75420 

tgatccttgg ctgaaacttc gatgaagaaa aggaaagagc gcagtggtta gggaggaaag 75480 

ggcattctgg gcagatgaaa tgacatgtga caaaatatgg gtgatcannn nnnnnnnnnn 75540 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn' nnnnnnnnnn nnnnnnnnnn 75600 

nnnnnnnnnn nnnnnnnnnn nnnnnnngca ctccagcctg ggtcactgag tgagagaccc 75660 

tgtctcaaaa aaattaaaaa aaaaaagtcc agaagaacat ttgggtctca ctctgtggcc 75720 

caggctggag tatagtggca caatcatagc tcactgcaca ttcaaactcc tggcctcaag 757-80 

tgatcctcct gccttagcct tgaaataagc ttggattaca gatgagccac cacacccagc 75840 

cagaccatta ttcataatag ccaaaatgtg aaaacaaccc aaatctccat caactgacaa 75900 

atggataaat agaatggtgg ttgatccata caatggagta tttactcagc aataaaaaga 75960 

agtcctgata catgctacaa ggatgaacct cgaaaacatt atgctaagtg aaagcagcca 76020 

atcacaaaag gctacatatt acaagattcc atttaaatga aatgttcaga ataggtaaat 76080 

ctaactttta tcacaggcaa agctatgaca ggaaatagat gagtggttgc ctagtgcttg 76140 

ggggcagagg tgggggtgag gcgagtgagt actgctaatg gtacagagtt acttttgggg 76200 

ataaagaaac tgttctgaaa tggactctgg tgatggttgc actactctga acatactaaa 76260 

actgttaaat tatatacttg aaatgggtga cttgtgaggc atggaaatta tatcttaata 76320 

aagctgtttt acatatttta catatttaaa aatgcaggtg gagggatgag ccctctaaag 76380 

agaagcagga gtttgaggag gttctaaata ttgtgtggtg ggtactgagg catataaatt 76440 

tgtcagacct catcaaaatg tatgatgtaa tcttcaagaa agttgatttt aaagaaacac 76500 

caccagcacc aggtggagaa ggcaggaaga agttacacaa ggggtaggcc aagagtggtg 76560 

gctcatgtct ataatcccag cactgcggga ggccgagctg ggtgggtgac ttgaggtcag 76620 

gtgttcgaga ccagcctggc caacatggtg aaagcccgtc tctactaaaa atacaaaaat 76680 

tagccagacg tgctcgcgtg aacccagggg gagaaggttg cagtgagcga agatcatgcc 76740 

aatgcactcc agcctgggtg acagagtgag actctgtctc aaaaaaaaaa aaaaaagtta 76800 

cataggggac agtggcaggt gtcaagggca ggcagggtct ctcctatctc caggataaac 76860 

tcatagggga cttagatgcc atgtgggtcc ctaatagccc tccacttggt tcttgcagcc 76920 

actcttatgt gtatcatttc atgtcaggcc tcttcttccc aacccaccca gccatcccag 76980 

cctggctgcc aaccccacct cctccagccc ctgtcacccc ataattgggg ccaggaggca 77040 

tgggagagtc gccatctctc ggtgccatct gttgcatctt tacagataac catggctgga 77100 

tgcggcagat cctggggtgg agcagccgct gttcagagca gtgatcaaga cctccccatc 77160 

tccacccctc aaggaatcgg ttttcttcca tagccacatc aggtgctgtg caggaaggag 77220 

ttgaaacgag aagccaggag caacgagaag gacactaaca tttattaagc actgcagact 77280 

ctcacagcac tcccacggaa tcgatattat tatccccatt ctgaagacca ggcaactgaa 77340 

gctcaatgtt taaggaactc accgaagtca ccaactgata aaagtgatgg aagctgggat 77400 
tcaaatccaa gctaaacttc cttccaagct tactccacaa cacagaggtt ggggaaaggg . 77460 

gataaaaaga gaggggagcc caattccatt tccacccagc tcctgaggcg gagcttgtca 77520 

gcacagctct ctccttccca gaataggaag atacccatca gaggcaagtc ctagacacca 77580 

gcagtggtaa ctccctgccc caaggcagct gcagacagcc tatggctgta gttactgctc 77640 

ccaaagagtg ttagaattcc cactcccagc ttcggggcca ctcacacaag gtgattgaag 77700 



46 



WO 02/29059 



PCT/US01/31488 



tggaaaccag agactctcca caatgccctc ctagagtaaa tgaggctatg taactttgtc 77760 

caaatgagta atttgaaaac ctgggggctc ccagctcctg aaaagggaag gatgtggggc 77820 

cctttatatt catactccac tttgtgcagc tctcccttgt cttatgatag ccctattaag 77880 

aaattcctct cccagcacgt ctccttcaaa gagctctaga cctgaggctg tcagaggctt 77940 

aggactctgc ctattagtcc cagggtctgg atgaccagca ggacacctgg cattcagtga 78000 

ccactggatt agataaatga aacagtgggc agagtgccac ccaatctccc cctgaagttt 78060 

gaagaggtcg agaagtgagg ctgtccaact gctgaccctg ctttctgtcc acctggccac 78120 

ctaacctttt ctggcttcca cctgcccctt tgccatccct ccccccagcc cacccagccc 78180 

attttcaggc atacctgggc acgtgctgga atagaagccc tcgttcttca gaatgatcaa 78240 

cagggagccc cagcccagga gtacagcaga gaagaagagg ttctccagca cagccgtgca 78300 

ggccatccac cagcgcctcc ggtacgcctg ttgcagcgtg ggggccatgc tggccccgag 78360 

cctgcacaga aacagagcgc tgggtgaagg gccccccagt ggccccaggg aagggtcctg 78420 

catcatggtg gcacccgaga cctctcgggc cagcccgcga ggagcccctc atggaggccc 78480 

catagagccc tgggcttccc agccggtgcc aaggagctgg ctccgcgcgc actagcagtg 78540 

ccagaggtgc acgcggcacg gggctcccgc tgagccacta tcggaaacaa ggaaggtcct 78600 

gtctgcgcgc tgcagcttcc tagcaggctg ccgggttctc tcacccaggc cagggcgctc 78660 

agggccgggc tgctggggag aaagtccgca tctgcccagg tccccagagg acagcaaggg 78720 

gcagagcgcg ctctgaagca ccgcgggccc atgtccggac tctcgcgcca ggaaagaccc 78780 

ctagaagctg gcaggaagaa gggcaagttc aaggctaccc tacgacccca tcttccagtt 7 8840 

gcccctccaa gacctctcct tccctctggg gccgggcgac agcaagccct ccccctttcc 78900 

gtatcaggtg acccacgacc ctacagtctc tcgggccaag ccaacagctg ccacgtggag 78960 

ggagacccag gacgggctct cctcggttcc ctcctccccc gcgcgcccct cactcactcc 79020 

gcagggctcg gggcaccagg ctttgcacct cggaacccgc ttgcccccct ccagccccgg 79080 

gagggggctc ggacttcggc aggaagtctg gcggctgctg actttataag ggcagcggtg 79140 

gcggatgggc tggcgggcgg gtgtgtttac caaagggagg gaaagagccc cagctccccc 79200 

cgccgcggcc gctgcagcct cggcgggagg agagggaacg cgggcagcgc gggggcgggg 79260 

agcgacaact gggatgagac cgaggaaagc ggagaggaga- agggcaagaa agacccagag 79320 

agaggggagg aagtaccagt cacttcttcc agggggactc ggtattctca tctgtgaaac 79380 

ggggctttgg gttcaagcgc tccaggaggt ccgctggaac tctggcaaac gcgcagctct 79440 

aagcagagga agtgcagcga gcggggaccc gggaggaaga gaagagtcgg aggggtcaga 79500 

gaaaagaaaa gggaaggacg cgcttggcga gatgggacac tgtgccgcgg gaccgcgggc 79560 

gcaagtaacg gtctttcctt gggaagcctg gcagtgtcgg cgggagccgg cctcggtgtc 79620 

tctcagccga cgcatagccg gagaccctac gcgcgccccc tccccgccca cgctgctcac 79680 

ctccggtcac cggcaaatga gcagccagca gctgcggacg cctccgggag cgcaacgctt 79740 

tcgcggcgcg tccggagtcc cgtgggccca gccctgagcc gcgccggcgc tggggtcttc 79800 

tctgcgtgca ggacccggcc gccacggagc ttcagcctga cagcccggtg gcctcgcctc 79860 

cgctgtctcc tcggaagaag cgggggaact gggaacccgc cgggcgccag aggtctgcga 79920 

agctgggctt ggatgaagtg gatctgcgga gttgatagtt gtatttacac gcgtccggag 79980 

ctgcgccccg aggtgggggc gggggctccc ttcttttccc ctccccttag gtcgagtttc 80040 

acgcgcacgt gactcgcccg ctggtcccgg acactctccc tctggcacag ccccagcacc 80100 

tacatttcca ccctggaccc ccatcttctc ccccaagccc ccagactaac atcaggcagc 80160 

gccctctgta tccttgttca aaacaaagtg cgattcggct gaagccgact gaccgcgatt 80220 

cagggccgcc ttgggtgggg ttttgaactg tgcagctgga agcagtgttt tccgagaggc 80280 

agagtggcac gggtttcttt ggagttagtc agatcgaggt ctgagtcttg actttttaac 80340 

tgactaccct gggttaccta gggcaagtta cctctctgag cctcagcttc ctcctcttta 80400 

aattcggtta aaatggaacc tacctaactg cccaaaggaa tcgcgattgt gatgcaggta 80460 

aaatgctaag catagcattt ggcatagtaa gcataatgtt aattgttgct gctgtcatta 80520 

tttcagaaga cctggtgatc ggatgcttcc agatcaacaa ttgattgact ccaggtaaat 80580 

ctctcagcct ccctgagcct cagtatcctc atctgtaaaa tagactacta tggtgtggag 80640 

taatgagaag taatctcatt acatgtgagt ttaattgtgt gttaagagtg ctgctaatgc 80700 

atgctgagct taatacctag gtgatgggtt gataggtgca ataaaccacc atggcataca 80760 

tttacctacg taacaaacct gcacattctg cacatgtacc ccagaactta aaataaaaat 80820 

aaaatttttt taaaaaaaga gtgatactgg tggccaggtg tggtggttca tgcctgtaat 80880 

cccagaactt tgggaggcca aggcaggagg atcg.cttgag ctcaggagtt cgagaccaac 80940 

ctggacaaca tggtgaaacc ccgtctctac aaaaaagaaa aaaaaatagc caggcatggt 81000 

ggtgtgcacc tgcagtctca gctacccagc aggctgaagt gggaggatca ctgagctgga 81060 

gagatggagg ctgcagtgag ccaagatcat gccactacac tccagcctgg gtgacagagt 81120 

aagactctgt ctcaaaaaca aaacaagaat gactacagaa agctccaaga aggcctcaga 81180 

taaaagggaa cccctgaaca gatgagccac caagccaaga gaggaactaa tggctaccat 81240 

agacagggca ctttccaaaa taaaaatact gttattaatt cctcaagaca tcatggtccc 81300 

atttaaacct catagctttt cacagaggga gaaactgcag gcttgaagct ggagcaaggt 81360 

tagaggtagg atgcagagtc aggtcggcct ggcatttaag tacggctcct tccattcctc 81420 

ccagaaggag aatggcaaga gcaaaggctt agctgtggga atggcacaag gagttctcgg 81480 

tggccaaagc acatgtcagg ctctgatggt ttaacttctt aaaatgcaat actgcctccc 81540 



47 



WO 02/29059 



PCT7US01/31488 



agaacttcca gatcaaggtc aaactcctca gctctacaca gggggaccta gagtcaactt 81600 

tctaagctag gagagtcatg gatccctttg agaatacaaa agacagtggg cgcggtggca 81660 

gtggctcatg cctgtaatcc caacattttg ggaggctgag gcaggaggat cacttgagcc 81720 

caggagttca agacctgctt ggtcaacata gtgagacccc tatttctaca aaaaattcag 81780 

ctgagcatgg tggcatgtgc ctgtagtctc agttactggg gaggctgaag taggatgatc 81840 

cctgagcctg ggaggtccag gaagctggag tgagccgaca tctcgccact gcactccagc 81900 

ctgggtgaca gagaccctgt ctcaaaaaaa aaaaaaaaaa gaagaaatat gttattgatc 81960 

tactcttgac aaaaatgctt gtgtgaacat ggacacacac actcatcaac attcacattt 82020 

caaggttttc atggaccctt tccatgaggc tctagtggtc catggacccc catggctgga 82080 

acacttgctc ttcctcatct caacccacat ttccatggag ttggactgtc tgctgcatga 82140 

ggacacaggc ctcatttggt gtgttcattc actgctgtgt atcccagcac ccagaacagc 82200 

acctcaccta aggggcactc agcacatgtg cagtgaagag tcagtcagct ggtttcacac 82260 

ctcccagtct ttgcacctgc tattccttct tgtgggaatg acagatttcc ttcatttctt 82320 

tttttttttt ttttgacaga ttccagctct gttgcccgag ttggagtaca gtggcacgat 82380 

ctcagctcac tgcaacctct gcctcccagg ttcaagcaat tctcatgcct cagcctccca 82440 

agtagctggg attacaggtg cacaccacca cctgtgagct gatatttttt tcttttcttt 82500 

tcttttttcc tgagacagag tctcactctg ttgcccaggc tggagtgcag tggcgtgatc 82560 

tcggctcact gcaagctcca cctcccgggt tcaagtgatt ctcctgcctc agcctcccaa 82620 

gtagctgaga ctacaggcgc gcaccaccat gcctggctaa tttttgtatt ttttagtaga 82680 

ggcggggttt caccatattg gacaggctgg tctcgaactc ctgacctcgt gatccgccca 82740 

cgttggcctc ccaaggtgct gagattacag gtgtgagcca ctgcactcgg ccattttttg 82800 

tattttttta gtagagatgg ggtttcacca tgttggccag gctggtcttg aactcttggc 82860 

ctcacgtgat ccacccacct tggccaccca aagtgttggg attacaggca tgaaccactg 82920 

cgctcagcct ccttcttcat ttctaatgta ctcatccttc acaactcagc tcaagtttca 82980 

cttctctctg gaagctctac tctaggctgg attcagggcc ttgtccacat acccaccaaa 83040 

tactctgctt acctctatgg aagtccccac actgatctag aataatcagc ttagttttct 83100 

gcccccatcc cgccccatga gatgtacatc ttgtgggggc ■ aggaaccacc acgtggtagg 83160 

tgatttgtgt gcctgctgcc tatcacaggg cctggcgcct' aataagcttg cggccaacat 83220 

ttgttgaata aatgaaaagg gaatggtggg aaaggaagct * gaaaaggtag gctaaaatca 83280 

gtttggaatt acctctggga ggccaaggac tttcagtctt gcagggtagg taacaggaaa 83340 

ctcctggatt ttgttttctt ttggttttgt ttgtttttaa tgaagggtag cgttatcgtc 83400 

aggtttttgt gtttaattaa tggagcatat attggaaagg acagagacct taaagcagtt 83460 

aggagaccac cataatagtt cacattttgc agccataaaa aggaatgagg ccaggcatgg 83520 

tggctcactc ctgtaatctt atcacttcgg gaggttgagg caggcggatc acctgaggtc 83580 

aggagtttga gaccagcctc accaacatgg agaaacccca tctctactaa aaatacaaaa 83640 

ttatccaggc gtggtggtac atgcctgtaa tcccagctac tcaggaggct gaggcaggag 83700 

aatagcttga atctgggagg cagaggttgc ggtgagccga gatcgtgcca ttgcattgca 83760 

ggtacatgga tgaagctgga agccatcatc ctcagcaaac taacacagga acagaaaacc 83820 

aaacaccgca tgttctcact cataagtagg agctgaacat tgaaaacaca tggacacaga 83880 

ggggaacatc acacactagg gcccgttggg gagtgggggt tggggggtaa ggggagggaa 83940 

cttagaggac gggacaatag gtgcagcaaa ccaccatgac acacgtatac atatgtgaca 84000 

aacctgcaca ttctgcacat ggatcctgtt ttgttttaag aagaaataaa gaaaaaacca 84060 

agaagaaaca aacaaacaaa aataattccc atttaaaaca ataaaaaata ggccaggcat 84120 

ggtgactcag gtctataatc ccaacacttt gggaggccaa cgcgggcaga tctcttgagc 84180 

ccaggagttc aaggccagcc tgggcaacat ggcaaaaccc tgtctctaca aaaaatataa 84240 

aacaaacaaa caaaatagcc aggagtggtg gtgcatgcct gtcatcccag ctactcaggt 84300 

ggctgaggtg ggagaatcac ttaagcctgg gaggcggagg tagcagtgag ctgagatcgt 84360 

gccactgcac tccacctgga gcaacagagc aagattttgt ctctaaataa ataaataaaa 84420 

taataaaaaa cagagaagag gaaagacacc tgagatatat ttccatatct gaatcaatag 84480 

gatttatcaa cgttctcctc tacccccaaa actaattcct tcctaaactc tgttctcctg 84540 

acactactca taggttaagt ataacagcat tatcacattg gctgtcatgt gggctcctgcf 84600 

ctagaggctg cttcacagct taatggacaa gagcactgag acagggtggg tctaaatcct 84660 

ggctctgcag ctgattattt gtgtgatttt gtccaaatca ctccatctca tgagcctcac 84720 

tcttctagtc tgttaagtgc tgaaaataaa agtatccaat tcaattcatt atttaatgaa 84780 

ttatttagcc taacaaatag ctattataaa tatttaggct gggcacagtg gctcacgcct 84840 

gtaatcccag cactttggga ggccaaggtg ggcagatcac ctgagtcagg agtttgagac 84900 

cagcctgacc aacatggtga aaccccgtct ctactaaaaa tacaaaaatt agctgggtgt 84960 

ggtggcatgt gcctgtaatc ccagctactc aggaggctga ggcaggagaa cgcttgaacc 85020 

caggagacag aggctgcagt gagccaagat cgtgccactg cactctagcc tgagcaacag 85080 

agcaagactc tgtctcaaaa aaaaaaaaaa aatctctgca tgaagaatgt acataaaatg 85140 

gtgcagccat ttcggaaaac agtttggcag gtcctcaaat agttaaacat agagttacca 85200 

ctatagccca gcaattccac tcctaaatat actacaccca agagaattga gaatatttgt 85260 

taacacaaaa atgtgtatac aagtatttat agctgtatta ttcattacag ctaaaaagtg 85320 

caaacatccc agcagtccat cagctgatga acggagaaac aaaatgtggt atacccatac 85380 
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aatgtcatat tatttggcca taaaaaggaa gtactgatac atgctacaac atggatgaac 85440 

cttgataatg ttattctaag tgaaagaaac cagacacaaa agaccacata ttgtatgact 85500 

gcatttatat gaagtgccca gaataggcaa atccacagag acagaaagta gattagtggt 85560 

tgccagagac tggagggagg agataatggg aaatgtggaa tgactgctaa tggtatgggg 85620 

tttcttcttg gggtaatgaa aatgttgtac aattagataa tggtgatcat tgtaaaactt 85680 

tgtgaatata caacatgctg aattttatac tttattatat tttatttttt ttgagacaag 85740 

gtctcgctct gtcacccagg ctggagtgca gtggcacgat ctcagctcac tgcaatctct 85800 

ctgcctccca ggctcaagca atcctcctgc ctcagcctcc tgagtacctg acactacagc 85860 

atgtgctacc atgcctggac aatttttgca tttttagtag agacagggtt tcgctatgtt 85920 

gcccaggctg attttgaact cctggactca agtgctccgc ccacctcagc ctcccaaagt 85980 

gctaggatta caggtgtaag ccaccactcc cggcctaaat tgtattcttt aaaagactga 86040 

attgtatggt gtgcgaatta tatctcaatt taaaaaaaac aaaacaaaac aaaaaaaaaa 86100 

cctttgcgtg tgtcaggcac tagggattcg atgctgaata agacacagac cctaccctca 86160 

gagaacacag agcccagcag gagagagtca cagatgaatc aagtgttaca tcatctatag 86220 

gaagcgccat ggaagaaaga catggtgcca tgagaacata cgcttagaga agggaatttc 86280 

atctagactg gggctcaggg aggaatcttt cagggtgatg cttgtgctca gagttttcca 86340 

tgtcagaatc agtagaattt atcaatcctc cagaggagga aacagcaaat gaaaaatctt 86400 

acaacaggag gatgcggaga cattccgaga gctgatcaag ggctggtgtg aacaaagcac 86460 

ataggatgca gagcctgtgg tgtgaggttg cagctggaaa ggtaaaacac taattacatt 86520 

ggatcttctg agacaataaa gagtatgcaa taatctcaaa cgaccgaaac tgaccttcct 86580 

cctccctaac ttgcttgctt ccactgttgc ccgtatcata aaagcaccac cctcttctac 86640 

ccagtggctt aagacacgaa actcaagtca tcccaggctt tctccccacc tcactctcca 86700 

catccagcct atcagcgagc ttgtgggtct taccacgtaa agacttctca tctccagcta 86760 

ctaccatccc ccaagcccag atcaccatca gctcaggcct ggactcctgc aacctttcta 86820 

accgggtctt cccaatccta cccccgcaac atgaccccaa tagcccatca gaatggacta 86880 

atcgagatgt agatttgatc aggccacatc ccttgaaagg cttcctgtga ccctcgggga 86940 

aatgcacaaa ctcccaatga tggcccctga gtcctgtgcc atctgggtct gccctctgcc 87000 

ctctgtgtct ttgccatggt aacctccttc acacccatta atactccatg ctctctccta 87060 

cctcaagttc ttcctgggct ggaacattct ctgcactagc ctagccaact aaccctttag 87120 

atcttttgtt tgtttgtttg tttgtttgtt tgtttttgag acagtcttgc tctgttgcca 87180 

ggctggagtg caatggtgca atctatctcg gctcactgca acctctgcct gccgggttca 87240 

agcaattctt ctgccttagc gtcctgagta gctgagacta taggcaccta ccatcacgcc 87300 

cggctaattt ttgtattttc agtggaggtg ggttttcacc atgttggcca ggctggtctc 87360 

gaactcctgg cctcaaatga ccaccctcct cggcctccta aagtgctggg attacaagca 87420 

tgagccactg tgcccaggca acacttcaga tcttaatgat catttccttt aagtgcctga 87480 

cctcttgtag taactagcct gactccagca atgaatcctt ttgcaatgta acctatataa 87540 

catctgagtt tccctttgat aaaactcatc atatatttgt tcctctgaca gttcagaggg 87600 

caagggcctt tgcccacctt cctcaccact atcctctcac cacttaacac agaactcacc 87660 

acccaccatg cctcctgcct gacaaattcc taaccatcct tcaaatctca ctcacctatt 87720 

accttctggg aggcagtctt ccctgagcac caagacaatg ggacacattc ctttatacac 87780 

cctgctgaac atctcttttt tgaggggcgg gtagagatga gtgtctcact atgctgccca 87840 

ggctgacctc aaactcctgg cctcaagcga tcctcctgcc ttggcctccc aaaatgctgg 87900 

gattacaggc atgagccact gtacctgacc gcaactgggt tagnnnnnnn nnnnnnnnnn 87960 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 88020 

nnnnnnnnnn nnnnnnnnnn nnncctgggc aacaaagacc cttcctctac aaaaaaaaaa 88080 

aaaaaaaaaa aaaaaaaaaa aaattatttt aaattagcca gacatggtag tgcatgcctg 88140 

tagtccaagc tacttgggag gctgaaggga gaggatcact tgagcccagg aggttgaggc 88200 

tgcagtgagc cgtgatcgta ccactgtact ccagcttggg caacagagtg agacctcatc 88260 

cctaaaaata aagaagaaaa tatggcaatt tgactgtaca tctctaatgg gatatatcct 88320 

aaggatgaga aaggaataag gaaggacaga aaaaaggaaa caaagaagta gcaacagtat 88380 

ttagcaattg tattgttatc aagtaacatc aatattggta aaaccagtaa ttatatttaa 88440 

aatactatat atgtgtatgt acatttacat atgcatatgt taggaaccaa gtttatcaga 88500 

ggaagagaaa gggctacaaa tgtaaaatca aggaaataaa aatttgaata aaaatatcag 88560 

tattaagtat ttatgatatt tttcttataa aaaaattata tatatgttaa ctctatccaa 88620 

aacccaaaag cagtgacaac ccaggagcaa taaaaaacct cagcatccag actgtagtct 88680 

ctaccatttc caattaaaga aacccagggc tagttgggaa aaatgacaat ttcatgtcta 88740 

gggcaagaaa cacacctagt gaaatggacc tgaacattta attgtgttag aaagtaagga 88800 

aactctctag aaataatgtg atttcatcta aaagacacag attctgggct ggtaaagttt 88860 

tcaatggcca aaggtgagac aatttgagca tcaagaagaa tcatgacaga acagattaaa 88920 

acatgtcaaa tatattttaa aatgaaatat tataaaagaa acaattagta gccatccctg 88980 

aaggtcacta gggcaccaac tcatatttca aactggtaaa taaatgtgta agccaagcat 89040 

ttatttctgg gtaacaaaat agtaaggaat gtttttcttt ctagaagaat tctagtgatt 89100 

aaaagtagaa gatagaaata gaaaatcatc cttttggcca ggggcagtgg ctgtaatccc 89160 

agaactttgg gaggccaagg caggtggatc actggagatc aggagtttga gaccagcctg 89220 
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gccaacatgg tgaaacccca tctctactaa aaatacaaaa attagccagg tagtgggtgc 89280 

ctgtaatgcc agctactcgg gatgctgagg caggagaatc gcttgaacct gggaggcgga 89340 

ggtcgcagta ggctgagatt acgccactgc actccagcct aggcaacaca gtgagactct 89400 

gtctcaaaaa aaaaaaaaga aagaaaagaa aatcatcctt ttgcgatcct aatgaaataa 89460 

tgggcctagg cattgatcat taatggctcc taaaatcact aagtatatgg ttgatgggaa 89520 

actttatagt ggatggatca gactcgcaat gtctaaacca gttgatcaat cttaacatcg 89580 

taacaagaca acagacacca ggggctgctg acaggagaac agaggaaacc catagctcta 89640 

ccactgagtt attcacggca aaaaaaaaaa aaaaaaaatt aaactgcgtt tcctccaagc 89700 

ttctaatcct gttgtttaca ggaaataccc aaggaaagga atacttttaa atgacacatt 89760 

aaaacaacgc caaatccaaa atatggggaa atgacccagt ttcttcaaca aataaacaag 89820 

aaaaggtagg ggggaggact gttctagatt ttaaaagcta tagaagacac agcaaccaaa 89880 

tacactgcat ggaccaggca tggatcctaa ttggaacaaa ccaactgtaa aaggatgtat 89940 

ttgaaatgat tggaggaatt tgaacagtga ctgcacagta gatgatatga agaaattatt 90000 

gttatttttt aggtgtgatc atgattttat ggtgatgttt aagtaaaaga ggccttattt 90060 

gttagagata catgtacggg tatagagaaa tatttacgga tgaaatgata cgatgtctga 90120 

gatttgcttt gaaaactcta gcaggtgtgg gagaagcagg tgcatgggtg ggggaaggga 90180 

tagatgaaat aagtatgcaa aatgttagtc tacttttgtc cctcctgacc cagcaggtta 90240 

aaatacctca gcatacctct actcctccaa ccaggtccaa ggatcaggcc aaaactccct 90300 

gatgtggtaa acagcctgac cccttcttac ctctctctct ccagccactt ccctaagatt 90360 

ccccagtgct ctgtgcccta gccagcccga ctcatctgcc cagattcctc aatgtttcac 90420 

tctctcattc accattttga ccccactgtg ctcctgggcc actctccaag gccccgcctc 90480 

ttcatctcct ccctccttac tcatccttca ggtcttggct taggtgccat tccctccagg 90540 

aagccttccc tgacaccaat cccatcctca cctagaacag attatgtgcg cttctttgtg 90600 

ccccccatgg ccccctgtgg gtttgcttca cggattataa ctgcctgact acctgccttt 90660 

ctccaccctc tagactgaga acaccttgag aaaaagaaca catctatctt gtctgtcatt 90720 

gaatccctgg tgtctggcac catggctgac acataactat cactcagtga ctagtgtttt 90780 

aatgaatgaa tgagtgcaac tagacagggt taagaacaaa- agagaagacc aggcatggtg 90840 

cttcacgcct gttatcccag cattttggga ggctgaggcg ggcagatcac ccgaggtcag 90900 

gagttcaaga ccagcctgac atggtgaaac cccgtctcta' ctaaaaagac aaaaattagt 90960 

gggggatggt ggcacacgcc tgtaatccca gctacttggg aggctgaggc aggagaatct 91020 

cttgaaccca ggaggtgcag gctgcaatga tctgagatca caccactgca ctccagcctg 91080 

ggcaacagag taagactcta tcacaaaaaa aaaaaaaaaa aaaaaaaaaa gagcgagaga 91140 

agatgtcatg gggtaaatga agacctccct tcctggttcc ctgaccagcc cctgccctcc 91200 

cccgcatctc acctgtcttt cttgcttcct tctggtactt ctgtttaagc cggtccatga 91260 

gcaggccatt ccagggggca cacagcactc cgaactgagt gaaggcaaag gcatttgtgt 91320 

aggtgctgac tgcagaagga agagagaggt tggttgatga gaagtttcca aaactccctt 91380 

ccaggcaggg actctcccac cttacccttg tctgcatgtc cctcctcccc acaccatcag 91440 

accctcctct ggtgtgtaca gccctgctgg gaggctctgt gttcccagct gggacatgca 91500 

gatgggctac ctcccagccc taccacatac ctcgtgccat gtccccaccg gccatgttgg 91560 

tcagcaagga gttgagagtg ccaatgaaga ggtagtgcca caactgtatc acagacagcc 91620 

acaccaggtg ccaggcaaag cgccgagaga aagcgtagct ccagaaggag cggagttcct 91680 

gcttctgccc tgcccctggg gtctctaatg gggagaggag gatctgggcg tgaattacga 91740 

ggaaagtgga caggtaggat ggggagtgtg gaggcttcaa tggaacattt cagatccggg 91800 

cccaccttct acccttggct cccagaactc acccggcgtg ctctgcacct gcctcggact 91860 

cacacatccc cacctccctg ctttgtcatg ctggccctac caccttggat gaccctctgt 91920 

gttcttctct attgaaatcc gatctgtctc tcacagcctg gtcaatgaca cttcttgcag 91980 

taataccttc ctgatctttc tcagcgagaa aggtgaaagg aacgacaagg agaggagaaa 92040 

gtcagaaggg agaggagaat gagtgtggat actctgttct aacctgctcc tcagcacctc 92100 

cctttctttt gataccagta tcctgagttt ctttgggaaa tcttcctcta ccctaatctt 92160 

catggtccag atgggaccat gaattcagtg ttctgttcct ctctcaaggt taaccaatga 92220 

gatggttcct cctaacaagg cagaccggcc atgagtttag attaggatgg acttaatcta 92280 

aaataggtcc tacaccctgg caagttcaat gtcctccctt gattttggaa gcttcccaga 92340 

accctattct tctttcttta aaataaataa ataaatacat gttttggatc caattgtcag 92400 

atggtaaaaa taaaaacaaa aaaatcaatt ttattctgta tatttaagat atacaatatg 92460 

aggtcatagg atacatatag ctactaagat ggttactaca gttaagcaaa ttaacatatc 92520 

catcatctca catttctacc tgttttgtga caagagcagt taaaatctac ttgtttagga 92580 

aagtcccaaa cacaatgcag tttgatgacc tacagtcttc gtgctgtgca ttagatctct 92640 

aggcctgttc atcctgctca tctgctcctt tttgtccttc gacctgcatc tcccatctcc 92700 

tctcccaccc cgttcttatt tctactgtag ctagctgcgg tttgtgatgt gtgtaaccaa 92760 

agacgcagaa cagagaggaa ggaaaggaag cagtgataga gttgggacaa taagagaggg 92820 

cggacccagg agacctggag aaatgggggc actgtaccag acttagtgca atggcatcac 92880 

agaagagggc agaaccgagg agtgggggga agggaaggca acccatggca ggcgggcttc 92940 

aaggggtggg gaagtgatag gatgcgaaat agagaaaaga gggacagaaa agagacgaaa 93000 

gccctggacc ctccattaag tgagagggtt gggaagatgc ctaaggccct ttttctgtcc 93060 
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tgcctttcct 
aaggagaggc 
tgaaaggaac 
ctttgtggtg 
aggactgtga 
ccaccgcact 
ttaaatcaga 
ctagtgattc 
ttcaaatccc 
gagcctcatt 
gatgaaatca 
ttcctctact 
atttccccca 
ccttgtgaac 
aaatctcaag 
atgagcccac 
ccaaacctat 
atcagatgta 
agccagctac 
tttgagatag 
tcctgtgata 
cccacaatca 
gtgcccccgg 
gaagatgaag 
gaaaaagtca 
aagcaaacta 
atggaatatt 
tcccagcact 
catagtgaga 
catacacctg 
gtttgaggct 
gaccttgcct 
catggatgaa 
tagtgcttta 
gggaaacagc 
gtgcacgcct 
tacatgcctg 
agttccagac 
agtggggcat 
tcgcttgaac 
ctgggcaaca 
tgttatgtat 
gttttctaag 
cctggtacac 
acgacccatc 
cagtgccttt 
aaattcatct 
cttggtgact 
accaccgacc 
gtgcagtgaa 
gaattcctgt 
catggtggct 
aggtcaagag 
caaaaataag 
ggagaattgc 
tccagcctgg 
aggaaagaaa 
gtgactctta 
ccctccaaac 
agagatcctc 
ctaagcccag 
cactcagcca 
catggttgcc 
ttgtttttac 



gattctgggt 
ttgggtctgg 
tcctttgact 
ccattcccag 
gccgaacctg 
ttggaatcac 
atgtctggag 
taataagcag 
agctctgcct 
ttttgcattt 
gagagcacat 
ccttgtggca 
atttgaatga 
tgcttcagcc 
aacgcttgtg 
gctggcttgc 
gtgagtgagg 
tgaatgagcc 
ccacagactt 
tttgttatgc 
ggtttgcttt 
ccagatagtc 
ggcatcagga 
gaggccctga 
gtgtcacacc 
agggtccatc 
attcagcctt 
ttgggacgca 
ccctcacccc 
tagtgccagc 
gcagtgagtc 
ccaaataaaa 
ctttaagagc 
gaagaccaag 
aagacctcat 
gtaattccag 
taatcccagt 
cagcctggcc 
ggtggcacgc 
ccgggaggcg 
gagcgagact 
attttattgc 
gagtcctagt 
acatgccccc 
cccacagccg 
tttcccccaa 
tacaaaacac 
ggtatgacat 
tagcccaacc 
ttgctcaata 
tcctgctccc 
catgcctgta 
ttcaagacca 
ctgactgtgg 
ttaaacccac 
gagacaaagt 
gatgaggcag 
tgccaggagt 
ctccaaagct 
ttctctgccc 
agagaagtcc 
tctgtggtgt 
acaactccag 
tctgctttca 



ccctggggga 
aggagagatg 
gtagctccct 
ggcacaggct 
agactcagac 
cttttagcca 
gtgagagcca 
caaagtttag 
cttcctagca 
agaaaatgga 
ggggtctgac 
aatgactgca 
gactgcagct 
aaaagaatgc 
catttctgca 
tggaggatga 
ccatcctagc 
ctgtcaaaat 
gtgaaaaata 
agcaatagct 
accctgtcca 
cttacccata 
ggaaagtgcg 
ggctgatgcc 
cacgttcata 
agcagatgaa 
aaaaaaggaa 
cgaggatcac 
ttctctagaa 
tactcaggag 
atgactgggc 
aaaaaaaagg 
actatgcagg 
acaggaggat 
ctctactaaa 
ctacttggga 
actttgggag 
aacatggtga 
acctgtaatc 
gaggttgcag 
ctgtctcaaa 
atacaaaaac 
acaagccatc 
acccacccca 
gtggagcctc 
cgacagcaaa 
accaatgggg 
caaagagaat 
tcctcccatt 
aattgacaga 
tttccccttc 
atttcagcac 
gcctggccaa 
tggcacgcct 
taggcagaga 
gagactccaa 
ccatctgggt 
agaaaggctt 
tatgggacta 
ccataacctg 
caggttggat 
gctggcttta 
ctatcacatc 
ccttgcgtca 



gctctggagg 
ggttagccag 
gttttcatgc 
atggaaacaa 
tggagggaat 
ctctgatgcc 
ggcttccttt 
gaagcatgaa 
acagaaagat 
gataaggata 
aattagtaag 
aaaatgatca 
ctatttcccc 
agcagaagtg 
ctctttcaga 
cagccacgtg 
atagccagcc 
catctacatc 
ataaatgctt 
aacagatgca 
tcccacccta 
gctgtagttg 
tgctacatgc 
tttttcataa 
gcagcactat 
cagctaaaca 
agaggcaacc 
ctgagcccag 
aatttttatt 
gctgagtggg 
cactgcaccc 
aaaggaaaga 
gccaagctca 
tgcttgagtc 
aataaataaa 
ggctgaggtg 
gctgaggcaa 
aaccccgtct 
ccagctactc 
tgagccaaga 
aaaaaaaaaa 
atcagcagaa 
acctactatc 
agacacaccc 
ctcgtgtggc 
ggccttttgt 
agcttgccag 
ccctaagttc 
ttacagagag 
gtaagcagca 
caactctaga 
tttgggaggt 
catggcgaaa 
atagtcccag 
ttgcagtgag 
ctcaataata 
tctccagggg 
gagtgcctca 
agctagactc 
caggcagtgc 
ggcttgagat 
gtcctccagc 
ctcacaccac 
gggaagaaaa 



tgaggggcca 
cagggctcac 
tcagctgttt 
aagccccacc 
agcatggtga 
caggttgcag 
tctaagatct 
aagagtaggg 
ggctcagact 
tctcagagga 
tgagcagcaa 
catttcttca 
atgccctgaa 
gctgtgccaa 
accctgaaat 
acccaggcat 
cccatgtaat 
tggccctgat 
aacattttag 
ctgctccagt 
gggccaatga 
gggggcagtg 
caggtactgc 
agaagctgca 
tcacaatagc 
taatgtgatc 
atgctggctc 
ttcaagacca 
taattagctg 
aggatttctt 
cagcctggac 
aattctgaca 
gtggttcctg 
caggagcttg 
taaatcagct 
agaggatatg 
gcagatcacc 
ctactgaaaa 
gggtggttaa 
tcctgccacc 
ggttaagata 
gaggcagggg 
ctgtaagctg 
ggcagtagag 
tccccagaga 
tcaaagaaaa 
tcatctccct 
ctcaacagct 
tgacgttgag 
gcaaagtcag 
gagacaggag 
caaggaaggc 
ccccatctct 
ctactcagga 
ccaagatccc 
aaaaaaaaaa 
aaggagggag 
ggggctcagt 
atgtctgggt 
caactgcctg 
ccacactctt 
ttgctgcctc 
aacattcaat 
gtccccttga 



ggaaaggcac 
cttccttcgc 
ccttctcttc 
agcaaggcca 
atcccacatt 
accagaccag 
ctatgtgaat 
caggcccagg 
taacccttct 
ttattgtgag 
aggaatgccc 
cctcctctgt 
tctgggccag 
ttccaagctt 
cacggtgtga 
ccctgtcact 
cctccaaatg 
cagcggaact 
gctgctgagt 
cctcctcctc 
ggggctctgg 
ggtatgggat 
agacagagat 
gaaggagaag 
caaaggatgg 
tatacacaca 
acacctacaa 
gccttgacaa 
ggtgtggtgg 
gagcccagaa 
aatgaaacat 
catgctgcaa 
cctgtaattc 
agaccagcct 
gggcgtgatg 
attacatgat 
tgaggccagg 
tacaaaaatt 
ggcaggagaa 
gcactccagc 
gtaaatttta 
ctggaaccct 
attagggaca 
gagtcctcat 
tcttctagcc 
ttttacacaa 
ctttattctc 
cagtttgaaa 
gtccagagag 
attaaactaa 
agaggctggg 
ggattacttg 
actaaaaata 
ggctgaggca 
accaatgcac 
aaaaagagag 
aacccagaaa 
ctctgcataa 
ggtgactgcc 
tgacctaaca 
cccttccttt 
ataattgaag 
gaggaagact 
atcttccact 



93120 
93180 
93240 
93300 
93360 
93420 
93480 
93540 
93600 
93660 
93720 
93780 
93840 
93900 
93960 
94020 
94080 
94140 
94200 
94260 
94320 
94380 
94440 
94500 
94560 
94620 
94680 
94740 
94800 
94860 
94920 
94980 
95040 
95100 
95160 
95220 
95280 
95340 
95400 
95460 
95520 
95580 
95640 
95700 
95760 
95820 
95880 
95940 
96000 
96060 
96120 
96180 
96240 
96300 
96360 
96420 
96480 
96540 
96600 
96660 
96720 
96780 
96840 
96900 
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atatacactc 
ctagcaacgg 
gctaagcggg 
attgacctct 
gacagctgga 
ttgtccaaac 
cccaacactg 
aggggcttag 
tatcatggct 
ttcatctgta 
tgcaggttca 
tatatttacc 
agtgatgatg 
gcctggatca 
ccctccttat 
cccatctgag 
ctgccgagtg 
ttcccccaat 
caaagtcaga 
cccacctgct 
cagcgtcctt 
gtcttctgag 
gtctacagac 
aagctgcctc 
ctggctcaac 
tgtttaatgg 
gaaaaggacc 
attgtgcttg 
acatgcaaac 
agagctgggc 
cattattctg 
cacttttatc 
aacactggca 
aaacaatgtt 
cctgctcttc 
aatcagaagg 
tttgttagtg 
ataccttaag 
tttctgtctt 
atagagtcct 
agaggaggat 
ctccacattc 
ataaggagag 
tgaacaaaga 
tgtcagcttt 
attgagccac 
ccaaggcaat 
ccacttgctt 
tttctcctct 
cttaacagac 
taccctgcta 
agaaagagaa 
tttttagggt 
caatggtgtg 
ctcagcctcc 
tttttttagt 
cagatgatcc 
cctggccaga 
ccttgctttt 
ttccaggatt 
atagccctct 
cccacctgac 
aaaaatctga 
gaaaaaacat 



cctttatctc 
gtaaatggat 
tcactgccca 
aatagccttg 
caagcctctg 
tagaagctca 
gcccaagcac 
cggttacaag 
actagctgca 
aaatgggggt 
caagacaagt 
ttaataataa 
gtcgaacggt 
cttatttatt 
acccaaactc 
gcagccaatt 
cctgggatta 
ggtgagcatt 
ggtagggtgg 
ccaaattccc 
caagggtttt 
ccaccttctt 
agaggcagag 
tccctttcat 
cctctagccc 
gaacaacagt 
atggtctttt 
gcactggaaa 
aaatggttat 
taggggaacc 
gccacaacct 
caaaaagtat 
ttacttgtca 
attcaattgt 
ccatcagtgt 
actgaaagag 
tagttcacgt 
aagcactgcc 
tataaaatgt 
tatgggctca 
tgttgctcag 
agttgcttaa 
tggtttgaaa 
cactcagcct 
agcatagctt 
gggtctaaca 
gacagaaaga 
tctctgtggc 
gtgtaatggg 
tggtgaaatt 
agtaaggata 
cattttaaca 
tttttgtttg 
atcttggctc 
tgagtggctg 
agagaaggga 
acccgcctca 
tgtaaagttt 
cacattttct 
ctttcaagct 
gctcacacct 
agcactcacc 
aacacataac 
gattctgctc 



attaaaaagg 
tgccatggtt 
gataaaactg 
gcaacagttc 
tgtctgcccc 
cagcagcaaa 
attatgacca 
ctcaggctct 
gaaccttcaa 
aacacagtac 
gtctggcata 
ggaagactgc 
gttggccaaa 
catgaaacag 
accctccacc 
atagaaaagg 
gggaaggctc 
ggcatggcca 
tgtcatagtg 
aacaatcctg 
taggataccg 
cccaaccaaa 
ctctaagtga 
ccttgacact 
attccctaat 
tcctgctctt 
ctttgttcac 
tgcaagatga 
aacaggtctg 
cttgacaaat 
atgcctgtcc 
ctatttttaa 
tgaatagaaa 
agctggatgc 
taaaatcaca 
aaataaaaag 
agtatgcgtg 
aagagctaat 
acatctctcc 
atgctttttt 
acgatttgca 
aaatcattta 
taattggttt 
ctgtgtttgc 
acacacacaa 
ctaggaagga 
agagtgaggg 
catggacatg 
tggaagggtg 
ttaaaacaaa 
tttgcaaaat 
cataggagaa 
ttttttggta 
actgcaacct 
ggattacagg 
tttcaccatg 
gcctcccaaa 
tgaataaatt 
caataaactg 
gcagatctct 
gccccagcca 
tgcagaggtg 
aggaaaagca 
agccagccca 



actggatcat 

ggctttaatc 

ttcgcaatga 

atcccctgat 

tgctgtaccc 

cgccccactc 

ctgccacaaa 

agaaccgaca 

caagcttttt 

ctacctcacc 

tacaagtgcc 

cgaggaagag 

taggttccca 

atacttcctg 

gctagagctc 

gtctctcctt 

ccacctgcag 

ggaagagcag 

caacccaaac 

gtatccaggc 

gccaaggctt 

gttggtcctc 

agccacctct 

cctggaaaag 

tcatggtatt 

cctgtctcag 

taaactctga* 

atcagatagt 

gtaagtgtga 

tcttcctccc 

tattatttgc 

aggacacttt 

gtaactgtca 

atttgacctt 

ctggccccac 

ggaatagctg 

tgcccctaac 

tttagagtat 

tactactaaa 

caaaactgaa 

ggcacgagtc 

caggctttta 

gaggtgctgg 

ccagcatgag 

gagtaatgta 

atattgggag 

atcaggaggc 

tctttgcaag 

gtggaaaata 

gattttgagt 

ggtcattcat 

ggatgtaaag 

gagtctcact 

ctgcctcctg 

cgcgcaccac 

ttggccaggc 

gtgctgggat 

ctactctctg 

ttttcacaga 

gaataaccat 

ccaggtgcct 

aaggctatga 

gaatattgtc 

caagtgtagg 



atgctgacct 
aatcaggatt 
ataagacaga 
accccaacat 
actagccact 
taaaggtccc 
agcttgggca 
agcctgggtt 
aacctcagag 
gagttgatgg 
cagtgaatgt 
tcaaatgctc 
atctggggat 
agcacccagc 
ccacctcagc 
ccctccacct 
gttggtgatg 
cacggctgag 
atggagcccc 
cccaattcta 
ccccagatat 
agtctgtggc 
ctcttccctc 
aagaccctgg 
ggccttgagc 
gtgctatgag 
gcacttcttt 
ccttgccctt 
gaccacagca 
caaaccagac 
taaaatgttt 
atatcactac 
aaataaatac 
agaatgttca 
atgaagacat 
ttctaccagg 
atcctcttaa 
tcacacagtt 
ggttggagac 
aagccctata 
aaacattacc 
gagtagatga 
gccatctcat 
tgcagacaat 
ctttctttcc 
gcatctcgtg 
ctgcacatca 
gggtcctgct 
aaccagattg 
acaatagggt 
ataatcattt 
gttttttgtt 
ttgtcaccca 
ggttcaggca 
catccagcta 
tggtcttgaa 
tacaggagtg 
aagtaatccc 
ccagcaatag 
gtggtctgta 
ctgagcttgc 
tgagtgtggc 
aaggagggag 
acttgaccgc 



ccacctatca 

catcccctgg 

atggttgttg 

cagccactgg 

tgccaccttc 

ccagcctcta 

agtctgaaga 

caagttccag 

actcaaatgc 

agacaaataa 

aggctgttgc 

cattgtacag 

gataggacta 

atgtggcaga 

ttgggccaac 

tcccgccacc 

agaaacagga 

cctggaccat . 

aaactctgct 

gccagcgttc 

ctctgtggaa 

aggccaggaa 

agtaaaccac 

actcaggtcc 

ttcaatcatc 

aactgagtga 

ggtgccaggc 

aaatagactg 

aaaaagctca 

ttctgcccac 

taagttgact 

tgtagatgaa 

aatgaaagga 

aagcctaaga 

tctttcatga 

tgatttgatg 

ctaccgtgct 

taccattcaa 

tcctttcaca 

ttatggagga 

cagccaccac 

tgctggtttg 

gagatctgtg 

ctcatgatgc 

taaaccaaaa 

gccaccataa 

ggcccacctc 

gtggcttcag 

gagttccaga 

tgtcaacttt 

cattaaaaag 

gttgttttgt 

ggctggagtg 

attctcctgc 

atttttgtat 

ctcctgacct 

agccactgca 

tctccatcat 

ctcaagatcc 

tatcttgcct 

atccctccca 

ggtggtgtag 

aaacctggga 

accctcagcc 



96960 
97020 
97080 
97140 
97200 
97260 
97320 
97380 
97440 
97500 
97560 
97620 
97680 
97740 
97800 
97860 
97920 
97980 
98040 
98100 
98160 
98220 
98280 
98340 
98400 
98460 
98520 
98580 
98640 
98700 
98760 
98820 
98880 
98940 
99000 
99060 
99120 
99180 
99240 
99300 
99360 
99420 
99480 
99540 
99600 
99660 
99720 
99780 
99840 
99900 
99960 
100020 
100080 
100140 
100200 
100260 
100320 
100380 
100440 
100500 
100560 
100620 
100680 
100740 
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tgggatgcaa 

gcgatgagat 

ccaaggctag 

acttagccag 

aaccccttgg 

ctcctcactc 

tataaaaaga 

ccaaggttat 

aaagcaatat 

gtagcaggga 

aaattgtaag 

cccgtctctc 

ctgtatcagc 

aggatgcagc 

ggaaaccccc 

ccaaagaaac 

gggggcggtg 

gatgaggaca 

ttgtaaccct 

agcacgtgat 

ggggctgtgg 

agctgctggg 

ggatttcact 

ctggcctttg 

atgtagccag 

gagaacctct 

agccttcctg 

tcgcaaactt 

ctggtgcctc 

tctcatcata 

tgggagccag 

gcattgccaa 

aagacaaaca 

agcccagtca 

ggagtggatc 

gcgtttgagc 

tggttgtttc 

aagatagagg 

tgtgaggtgc 

tactccaata 

ggtcttgatg 

gagcaggtgg 

gacagcaact 

ctgccaagtt 

aggaacttgc 

aaataccccc 

tcctcgcatc 

cccgctccag 

ggcgccccat 

tctccgcgcc 

tcccctgctc 

tctccatcac 

ccctcttgaa 

cctcgctgcc 

aatgctgtta 

cacagacagc 

cgctgggttg 

gtttcacact 

aggactgagg 

aaacccctta 

gccagactgc 

attgctcatc 

tccagcccag 

aagctcagtt 



cgggcactga 
tgagatgtgg 
catggaaagt 
gacagaatgg 
agaaaggggt 
tgtgaccttg 
aatgatcgga 
caggctccct 
cctaaggctg 
gccaaggacc 
ggactggctt 
tctctgacag 
ccccagcagg 
aacacactgt 
ccagtctccc 
ttcctattag 
aggaggatga 
aggatccttg 
ggctccaagc 
agggaagcca 
tgaagtgggc 
agcgagctgc 
ccaagctagg 
ggacttacat 
tggggaatgt 
catcctgggc 
ccaaatgagc 
gtcccttaag 
cctcccaggc 
gaaagagggg 
acttaacttc 
tcggcccagc 
ctagtgaagg 
gcagtgtggc 
ttcaaatccc 
acttggaaaa 
aggcctgggc 
ctgctggaag 
attaagaggc 
catcatctta 
gccacctgga 
agaagcggag 
caaccaggtg 
ggtgttatct 
ccaaggtcac 
aactcccatg 
aacttctacc 
cctgcggcct 
ggcgcccgaa 
ctttccgcac 
gcctccgccc 
ttccttcttc 
acttctccct 
tttcagcctc 
ctcgctcagg 
acatgagggc 
gcagggaaca 
cccagagctg 
gcttttcaga 
gatgactccc 
attgattaac 
ggagcttgtg 
agcacgtgag 

tggggagagc 



tgcctctgag 
tcctgacctc 
cttggacaat 
gcaaagcgaa 
tagtgcctgg 
ggcaaggcgc 
ctgatctagc 
tagcatttgg 
gagaagaggg 
aagaaggact 
tactctggct 
ccaatctcac 
ccgctgatta 
gtgtgagatc 
ctccagtcag 
ctaagcccct 
ggatgaagcc 
gggtgaagga 
cagcccttac 
catttatgca 
aagtctagag 
aggtgtggtg 
ctgctgcctg 
ggctatgagg 
catgaagttg 
tttgcagtct 
acacaggctg 
ctgagttgaa 
acttcccaga 
tgggcagggg 
cttagaaaag 
atctggtcca 
ccagccaaag 
cacgtgcagg 
actttgtcct 
ttcctctggc 
aaaaaccatc 
aggaggcctg 
aggatgatca 
ggcactgcat 
gtcacaggtg 
ctcctgcgct 
cagaggcagg 
tcagtttact 
agggctagtt 
accctcgcct 
aggaaagcct 
gccctccccg 
aggggaccct 
gggccaggtt 
cctcctgccc 
ttttcccttg 
ctagaacccc 
ccagccccct 
cccacctagg 
atagggacac 
tatttttcct 
ccaaaaaaat 
aaccatcccc 
tgcctcaccc 
cattcattgc 
atcagcgtga 
ctgacgtctt 
ttcttgcgta 



ccccaggctc 

atggacagtg 

gagctcagct 

ggtcctccca 

tctgcaaatc 

tgtccttctc 

tcagagtgct 

gggtcttgta 

aggggacaaa 

gaggtacagt 

gtttccggaa 

atgtgcctcc 

ccactgagcc 

acgtcccgaa 

aagggacctg 

agggagtgat 

tgggcaacct 

agagaagagc 

aggaagtcac 

gagcagggaa 

agagtcccgc 

cctggcaggg 

aaggattcct 

cgtgccacgg 

ttcatgaagg 

ggagtagaaa 

ggctcccctc* 

atgtggctgc 

tcaagtgggg' 

gcagagtcct 

tcatccctgc 

cacagatcct 

aggacgccag 

ggcaggccct 

cctggacgga 

aagccaagcc 

agcgggtgat 

cgggaaaggg 

tggccgctgg 

ttgagtaacc 

agataaacga 

cgaattctga 

gagatttcat 

gacaatgaaa 

tagtcacgac 

aggactcgca 

ccgggggccg 

cagaggagcc 

tcgccctacc 

cgcattcgcg 

gcccggaagg 

ctcacagcct 

ctagaacccc 

tgcctctgcc 

gaaaatgtca 

acacactcta 

atttgctcac 

cccaaccaca 

tggaggactg 

ccgcccccca 

cccatttttt 

ggccttacta 

ctttggcctg 

ttagatgcag 



aaaaccaggc 
cattttgctc 
gacgatgtga 
cctggaagcc 
aaggccttga 
tgggcctcag 
atgatttcag 
aggcatggag 
ggaaggggag 
cattctgcat 
aggcaggccc 
ctgggagcac 
tggccacaga 
ccttctgact 
aaattccacc 
tggctgttgg 
ggatgtgagg 
aattttaggt 
cctggcctcc 
cgaggtaagg 
tgcctggggc 
tggccgggct 
cttacccacc 
tggtcttgaa 
accccagggt 
aaaggtctcc 
cacctcagac 
ccctaattac 
tgagagctgc 
tcctgctcct 
ccttaccagc 
taaagtaatc 
caaagcccag 
ggcccgccat 
tcacaggcgc 
cttcctttcc 
tctctggatc 
aaaggtagac 
cagcaaatgt 
acgtggcaag 
cttacccaat 
ataccgtccc 
acagaagaca 
caaaagctcc 
gcaggccatt 
aacctggtcc 
ctccccgcca 
cgaggggcca 
cgcctgctcc 
cctctcgcag 
ggctggggca 
cccgcgccct 
agcggtgtct 
tcccctaacc 
cacccagcac 
tttgtgcatt 
cagcttaacc 
gaatcaggaa 
ccccatattt 
ggttctgaaa 
attaatcaaa 
agcagctgcc 
tgtggccgtt 
tctgcagact 



gcaagaagcc 
attctgaggc 
ttggcttggg 
ccaacagccc 
gttctaactc 
gagccttttc 
gactacagtc 
taaaaaaaaa 
gaaaggggag 
ccaaaggctt 
agccagccct 
ctgctctgag 
gcacgagatt 
catctgcaca 
agtggcaata 
ggcggggagg 
ctgtgcaggg 
tttgctaatt 
ggctcaattc 
aaatggaagt 
tgttcctaac 
gtctgactct 
tttgcctggg 
ccggtcaaag 
gaagatgagt 
catgcatccc 
agcttgtcgg 
ccctcaggag 
tgacccttcc 
tgccaccacg 
ctgccctgtg 
ttcattcttg 
gcattccagc 
gagcagaagt 
cgtaagcctg 
cgtagctctc 
ctgtagaata 
tagagttatt 
ggggaataaa 
tagagaggca 
agctccccgg 
ctaaaataat 
caaactcccg 
catggatttc 
ctactgccag 
ccgccgccct 
gcctccgcac 
ggccgcgctc 
gcgccggggc 
cccctcccag 
gacctcccac 
ttttacctct 
ttccctccct 
aagttagttg 
ccagaggaca 
ttgccttgac 
gtctctccca 
gccaagaacc 
tcactcccaa 
gagccttccc 
gacatatata 
ttactatcct 
tccttgccaa 
cccaacccca 
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gctacctgga 
tttaagagga 
caccaccaaa 
gaagggtggc 
tctttggctg 
aaaagaagta 
gtagggtagt 
ggaggaattt 
tctgaaacgt 
gtcttttctt 
acaaacaggg 
ttaatgcatg 
gtgtggaagt 
gttaatctgc 
tgaagcacgg 
cttgactctt 
cactaacaca 
cccatgttct 
tcctgcgtgg 
ggtctgacgg 
acaccttttt 
tgtgccaagt 
ctcttgtggg 
aggcagatgc 
ggaaccagct 
tctaagtgag 
cccttaatat 
cacagaggca 
gttcccactg 
tggtaaggac 
agggagcata 
tttccttccc 
gacagagtgg 
tataagtgag 
ctcaagagca 
tcctctgaac 
tcctaccctc 
ataaagggct 
tcacaatggg 
ttctacagcc 
cagagaaaat 
agccaattac 
tatggattaa 
tactgttgtt 
tagggagaat 
aagtaatggg 
tagatgacac 
gcttggattg 
gatggagtct 
gagggtgggg 
tttctcttta 
cactctggcc 
cagcctggat 
actggaagca 
cttatgaggc 
caactgttta 
cactgacaat 
ggaggcacgt 
tgagcactga 
cgataggtgt 
aactccccca 
ctgaaactaa 
tcaccaaggc 
tcaagtgatt 



tcccctgagg 
agaaatagag 
gcacaaaaga 
tggggtcctg 
ccccacctgt 
taagatgggt 
agattttcta 
caaaggaact 
atgctctcat 
aaaataacat 
cttaagatgc 
ccctatcttc 
agcaagggag 
aggagcctgc 
gacacggggc 
agagcctcca 
agtcttactg 
ttggttcgaa 
ctccaatgag 
tgcacccgat 
ttttttaatt 
gccgggattc 
agagacagac 
tagcacagca 
agagcacagg 
ggcattaaga 
ctctcctatc 
tcatctgctt 
gatggcttca 
cagtccaggg 
gaactggatc 
atcatcagca 
gaagatggat 
ctgcactcca 
ccaccttcca 
aacctcagcg 
tgacatcact 
ttgatttcgg 
ctcatgtaaa 
tgttttggaa 
tctaatcctc 
tggtatctct 
agataatgta 
aacaaatatt 
ttcaaagtgt 
tgagctcagt 
ccaaatgcac 
tcctgtaagc 
tagagctggc 
aactactaca 
acagaggtga 
cctcagagca 
cagcctccct 
ctgggccagg 
tactgctacc 
ctggtaactg 
atctaaacct 
cctacccctg 
ctatgtatct 
tgttattagc 
acatcaccag 
tgcttttttt 
tggagtgaaa 
ctcctgcctc 



gcccaggaac 
gttacgatag 
taaaagtctg 
taccccaagc 
aagacgagga 
gtatgaggtc 
gtctgcagtt 
tgcccacggt 
caccccgtct 
tatttatggg 
aagattttct 
tgctctggca 
aaacagaaaa 
cagaaagatc 
aggtcgtcca 
ctccacatct 
aactgatggg 
ttccagcagc 
gattaatgag 
gtcttcatcc 
gactgattgg 
gtagagaaga 
aaagaaaccc 
cattgaaagg 
gtggtgaaga 
attcagccca 
tatccgtggc 
ccaagccatc 
cttaactgct 
tctggggatg 
tcctgcatcc 
agtctggata 
atgtttaggt 
tcattcccct 
atgcagtccc 
agggcaattg 
ttctaaattt 
atacgccaaa 
ttaaaatgtt 
gtcagaaagc 
ttgaggcagt 
gagcttccgt 
tgaaaactgc 
attgacttac 
tcctaaccta 
ttaccttgct 
tcagtacaac 
tgtgggaagt 
tcttcagggt 
tagccaagta 
atctgaaagg 
gccagctcag 
ttcaccatgc 
cctaagagaa 
caccctccca 
cacttcccga 
gcaactctaa 
agaagaaaaa 
gattcgaagc 
atctattatc 
gtagcagtgt 
tttttttttt 
tggtgtgatc 
agcctcccga 



tccagctatt 
gggacagcca 
caaccaccct 
tactcactag 
caatagtacc 
cctgcatggc 
atgtagacag 
cattctacaa 
taacaaacat 
agaaaatcca 
attttactgc 
tacattttag 
aagtcaaagt 
tagctcatgg 
gggttctgtc 
cccatcaatg 
acaggaaatt 
tagaaaaggc 
taacatcaga 
ttttctcttc 
ttcaacaaat 
gattcagtgc 
aagatttctg 
cacggaacct 
aagtggaaag 
catcaatcaa 
cactgctcta 
gccattctcc 
caaaaccctt 
ccagccatga 
tactgcccaa 
tccaggatcc 
tagggaaaga 
ggcacaaaca 
tgcctgtcca 
ccactctctg 
gtatatgtag 
cacataaaca 
tggttttcca 
aaaaggtaaa 
gccaaaaata 
ctcctcatct 
ccaattagta 
tggaaaacaa 
aaaaagaaat 
taagaagtcc 
cccccaagac 
ctgttcatga 
atctcctagt 
aatatgaggc 
atacccaaag 
cttcagtgga 
agaaaacaga 
agcagagccc 
agccccagct 
tagactttgc 
ggactaggct 
taaggaatat 
gctttttgct 
tgggccaact 
caggactggg 
tttttttttt 
tcagttcact 
gtagctggga 



ccaagcccac 
gaactgagga 
agtgacttga 
ttatacaacc 
ttaattatag 
gcaggtgcta 
agccagagaa 
agctgcagta 
ttggacatta 
caaaaatata 
aagacacaaa 
tctcctgggg 
aaagagacag 
gctatctgta 
caccttatct 
tctgcagaag 
agaatatcct 
agatgctatt 
gagagaagtg 
gcctccttcc 
acatgtggta 
ctgctctcaa 
gagtgtggga 
caacaaaaca 
atttgaggcc 
tcatgtccta 
tgcagacact 
tgcaagagtt 
ctgaggtcca 
gacattgctt 
gtaccaatgc 
accctatgga 
gggtttgcca 
atggctagta 
cagacctctc 
ggcagagtcc 
ataaactctg 
aacaaacata 
cctacggtct 
tgcaaacatc 
atacaagcac 
ataaaattgg 
cactattaat 
agaggaaata 
aaattagggg 
caccctagag 
tgtctgggct 
gccacagtag 
gtgttcaaag 
ctccttgctc 
aggcactgga 
tgctagaggc 
gctcccccac 
caagccccac 
ccacttctat 
tagaaaggaa 
ggggaacact 
caataatacc 
taatctgtca 
gaggcctaga 
atagaaacct 
tgagacagca 
gcagcctcca 
ttacaggcgt 



tcctcttttt 
ttttccagct 
ctgaatggag 
tgaggcaagc 
gaattgtcat 
taggcagatt 
gcagctctgg 
ccttcccaac 
gagaaaacaa 
gcatcccagg 
gactctgaaa 
ggatcagtaa 
attttagaat 
catccaggac 
tgttacctct 
acgtggcctc 
ctgaaccatt 
ctgatcactc 
attataataa 
tcatcatctc 
cctcaggctc 
ggggctcatt 
atggtcttcc 
ataacattta 
agcgtcgcca 
ttgatttcac 
catcatctct 
tatttccatg 
gtcaactggc 
tgaggggaag 
tggaggtggt 
tgtttttatg 
aagagggcag 
tcctctagtc 
tcctcaaact 
agatattctc 
agccattcac 
agctttcctt 
tgaaaggggt 
atttcacctg 
actgctatcg 
aatcgagctg 
aaatagcagc 
aagtcacatt 
gaaaaccact 
aactgatctc 
taaggcaggg 
acaggaaggg 
cagttctcag 
tggggagacc 
gggtgggggc 

agcagaggat 
cagaccagat 
cacaccaggg 
gttcatcaag 
tgcctcagtg 
gtgtcaacat 
tgctgggcac 
ctgaatctca 
gggacgaagc 
gatgctctga 
tctcactctg 
cttcccaggt 
gcaccaccat 
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gcccagctaa 
gtctcaaact 
caggcgtgag 
cctgggaggc 
accagttatt 
aacaagagag 
accacaccca 
actcctgagt 
tgaaaagagc 
caaagggaag 
tagggattag 
gagagcacat 
caagtctgtg 
ggaagtggga 
gcacctttct 
tagccctgac 
gtcgtggggt 
atggtgagtc 
aaaccacgga 
tgggctgcaa 
aaagcattct 
acaaaattgc 
ttctcaatga 
tgagtgctgc 
tgatgcccac 
tgaagaaact 
ggaagagata 
cttttcctgt 
acagaagtcc 
gacatctctg 
ttgttgtgat 
gataaacctg 
gaatcctaag 
gaaatcaata 
ttgacacata 
atatccaagg 
gatcctatga 
ttccttgttt 
tgagacccac 
tcacttctgt 
aaaacatgga 
ctctgcaacc 
tagcacccac 
gaacagtgtc 
nnnnnnnnnn 
nnnnnnnnnc 
ggtctcaaac 
acaggcatga 
cgggaagtgc 
ggattctcca 
aggacagtct 
accagagtcc 
ttccagagat 
cccacaagct 
taacttggta 
gctcacatct 
gagttcgaaa 
ttagccaggt 
aattgcttga 
gggtaggcaa 
ctttattctt 
tgataatatt 
ctatccatct 
aatgagaatt 



tttttttgca 
cctgggctca 
ccaccatgcc 
ttcccaagag 
actaatagga 
tagtgctgcc 
cccacacgca 
gaaaacaacg 
aggaaactgc 
ccggaagaga 
ctggagggag 
gtaggatggg 
gtgaaattag 
gacccagggt 
ctcttggggc 
ttgctgaaat 
gctgcaatgt 
cgccctggga 
agttcccggg 
aaccgcaaga 
gccctccagg 
ctcaggggca 
aagcaccctt 
ttttgggaca 
agccacccca 
gaggcctcaa 
gacttcccag 
gccaatgtga 
tctctgtgtc 
ttatactact 
taagtatagg 
aaatgcaatt 
gctgactggt 
ccagaaggtt 
ggtgaagctt 
cagggagcca 
agatcttacc 
gcttagatta 
tgaagaggaa 
gttgtcactg 
cttggaagcc 
tcgggcagat 
ctcaacatct 
tgccatctgg 
nnnnnnnnnn 
taatttttgt 
tcccgacctc 
gccaccacgc 
ctgtggggct 
tgacctctgt 
gatggctggg 
ttccgataca 
atccatgcat 
tttgacatca 
tctctggtac 
gtaatcccag 
ccagcctggc 
gtggtggcag 
acctgggagg 
caaagcgaga 
tttgatggct 
taagttatgt 
gtaaatgtct 
tctgggtcaa 



ttttttgtag 
agtgttctgc 
cagccagctg 
ctacacagag 
atgaagatgg 
tctctgtgat 
tgatcatgaa 
gggtttttca 
tggggccgat 
ctggaacagt 
aaggagaagc 
aaccccagat 
ggaaggctcc 
ccagtcctgg 
ctccattcct 
gctgggactc 
tcctccagat 
gcctctgctt 
gctgatgggt 
tgggctgtga 
tcttaaagca 
tcagggaggc 
ttgggtatta 
ggcattgggc 
taaggaggcg 
gagatgaagt 
tatccagagc 
tctgcctcca 
ctcttcagcc 
ccaatggtgt 
ggaaatgacc 
tattaattaa 
cagcacaatc 
ctttgttgag 
gacatacctc 
tctgggtgtc 
tttctggctg 
atttctgcgt 
taatcaatac 
tatcctcatg 
agaccgtctg 
tacctaagtc 
gtcaagagga 
taagcagtcn 
nnnnnnnnnn 
atttttggta 
aggtgatcca 
caggccccac 
ggtgtggaca 
tactcatatt 
caaaccctgc 
tgattctggg 
gcatcggcat 
atagtagcat 
tgctcaatat 
cattttggga 
catcatggtg 
gcgcctgtaa 
cagatgttgc 
ctctgtctca 
gtaggtggat 
cttcagcatc 
ttttgaacat 
agaggatatg 



agatggggtt 
ctgccttggc 
atgctcttaa 
catgagttct 
gtaattcttt 
tctgtgtctc 
aagaggaaat 
cattgagagc 
tgaacactgg 
ttccatggtg 
tggggtggga 
gatactcaag 
actgcggact 
ctctggagcc 
acctctgtga 
tgtacagagg 
gctgcacggg 
ggaagctgaa 
tcttatagat 
ccactctcaa 
aacacagact 
agcaggcctc 
ataatgacaa 
taagagctat 
gaggtactat 
aacttggcca 
ccatgttttc 
ggaatcctgt 
tgatagtata 
tccctcccct 
cacactaaac 
cactgagaaa 
tctttcagga 
tacaaagtca 
tataaagcct 
ctctcctttg 
catttatcat 
atttaataga 
atactagttg 
cttccttata 
ggtttgaatc 
agtttcccct 
ttcaatgagg 
nnnnnnnnnn 
nnnnnnnnnn 
gagacagggt 
cccgcctcgg 
acacctttta 
tgtgggtgaa 
cccacactcc 
ggcaaaccat 
cagctgttgt 
atgtgtataa 
attattaaat 
aagaacatat 
ggctgagatg 
aaacccccat 
tcccagctac 
agtaagccga 
ataaaaaaaa 
gttctaaaat 
atatgaaact 
ttctagaata 
cattttacat 



tcgccatgtt 
cccacaaagt 
tctgtgccct 
ggaatcgggt 
cagacagcac 
cctgccctgc 
ggatccagga 
tttgcccaac 
acttttgttg 
ctggaggatg 
ggggaacctc 
gcatggcatt 
gtagacagag 
ccctgggtgg 
agcgagtgct 
ctgacattaa 
agagggcaga 
gtggcctgag 
tgtacatgca 
ggaaagagcc 
caatccttat 
aaatgtgtgc 
cagtaatgac 
atgtaatata 
cattatgcca 
agtcactcag 
accattatgc 
cttgatgttc 
tcttttcata 
acccctccct 
aaactcataa 
tgaaaccacc 
aggacaggct 
gagggaaggg 
ccatcctgcc 
gacagtgctg 
gattgtggaa 
actgaaaggc 
tgttgccctt 
atggagggac 
ctggctctgt 
tctctgaatt 
gaatacacat 
nnnnnnnnnn 
nnnnnnnnnn 
tttgccatgt 
cctcccaaag 
aacaaccaga 
ggtggcactg 
tcaaattagc 
tccccagccc 
tacccgtgtc 
ttattatatc 
tgttctgtac 
agacctggcc 
ggtggatcac 
ttctactaaa 
ttgggaggct 
gatcacgcca 
aaaaggaatg 
ttgtgtaacc 
tacaaacaag 
attgcaggat 
ttaatagata 



ggccagactg 
gctaggatta 
acccagcctt 
tgatgggggt 
ccttgattaa 
tcacacagac 
gaaggagacg 
accccaaaga 
tggaaaaagg 
gggaagtggg 
cacttgccag 
agaccagaag 
cactggacaa 
gctgccccgg 
gaacctctct 
gcagggatct 
aaaggcctat 
agtgactcag 
gctctcctcg 
ctatctgcaa 
tccttttaag 
ctttctagaa 
agtcatttac 
tattattatt 
actttaaaga 
ccagtaaatg 
tgaagtacct 
ccttccccat 
ccattctttg 
gggagcttag 
gagactgatt 
cagcagatgg 
tttgggaaag 
agttgatgga 
aaggatcaga 
gatttttctg 
ggctttttgt 
aatttcccat 
tgcagagaat 
agagatggta 
tacttataag 
ggggatataa 
aaagtgctca 
nnnnnnnnnn 
nnnnnnnnnn 
tggccaagct 
tgctgggatt 
tttcattcat 
ggagaagtta 
ctgagtctcg 
tgccctctca 
ctccatgttc 
tatatttcat 
attattttat 
aggcacagtg 
ttgaggtcag 
aatacaaaaa 
gaggcaggag 
ctgcactcca 
tatagacctt 
aatctcctat 
gttgcattga 
aaactcctaa 
tttgtcaaat 
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tgtcttccaa 
tttccccaca 
gaagctgttg 
agagaggtta 
cgatagcaga 
gaatggagga 
ccaccctgaa 
agtatttgga 
gaacattata 
atataatgta 
tttggcaggc 
catagcgaga 
atgcatgctt 
gagtttgagg 
aggaagacca 
aaaagcagag 
ctgttactta 
aacatataaa 
gagataatgt 
ggaaattcat 
ctttgatctg 
aggtataggg 
accaggcagg 
cccaaagggg 
tttatcgcgc 
gcttttacaa 
tttaaaatta 
tagcgataca 
tcagctcaaa 
gtaaaattag 
gataatgttt 
gaggatgatc 
caggctggct 
actgcacact 
agggtatatg 
acaagatttt 
ctaggaggac 
gtctcaaaaa 
acgtgctcac 
tcatcacagt 
tttgttttta 
tcttcaagta 
catattctcc 
tgcaaattct 
tttctgcata 
aactgcagtt 
gcctggagct 
ttagtcagtc 
tccagagttt 
agtgctgggt 
cacacggctt 
tgtttactct 
ctgttgaagt 
ttaattgctt 
aactacaccc 
gtcactggcc 
tcccccacta 
ccatttgccc 
agtggcttca 
cagaaatgag 
ggtgcctata 
cagagtctct 
tcacggccac 
ttacccactc 



agtggtcgta 
ccctggagag 
gccgttatat 
gttagtgggt 
ggagggtgac 
cttaaaatat 
tgtgccagat 
tgggagaaat 
cattctatat 
tcagtaaaga 
cagggtggga 
ctctgtctcc 
atagtcccag 
ctgcagtgag 
tgtttctaaa 
tggtaagaga 
cccatcgtgt 
atagaagtaa 
atataaagct 
tgttgtagtg 
gatttggctc 
tggtctgagc 
tgctgataag 
ggtgtacaaa 
actgaaaaac 
aaaatgctaa 
ctaagtcata 
ttaatgactg 
ttctgcaatt 
gcaggagggc 
gtttttatct 
tggctgtgac 
gggcaggtgt 
tggtcaaaga 
agcacctgtt 
attgtgcatg 
tgcttgagcc 
aaagaaagaa 
ttctctcttt 
gagcacaggc 
cagaactcat 
aaaggaaaac 
agactttgtc 
tttatcctga 
ttctttataa 
ttcttaacca 
tttttttgtc 
aaaaagtaat 
tgcacactgc 
gatggtgttt 
tcatttctat 
ctgcataccc 
cttgaggctt 
ccagccagat 
tggacagatt 
tactgtgttt 
ccagtagcag 
cacctcctac 
acccacagaa 
aaaagcgttg 
aggaaatggg 
tgccccatga 
tctaggcagg 
tgctgaacac 



ccaattaaca 
atgaaaaatt 
aaccctcgtc 
gcaaacatac 
tatagttaac 
tccaacacat 
cttgtttgtt 
gataaatgct 
atgtaacaaa 
gagggctggg 
ggatagcttg 
acaaaaaata 
ctacttggga 
cctacgactg 
agaaataaat 
aaggactttg 
gataagcaaa 
aaatcatacc 
tctggaataa 
ggcagccttc 
aggcaagacc 
ttccccagca 
accggaatgg 
acgatctcgt 
aaaaccaaca 
agtaggcatt 
gtacataatg 
aagttcttta 
acacaggctg 
taacctcgct 
cacagagttg 
acctgtcacc 
cccctttctc 
ggaagacctt 
ctgtcctgcc 
gtagcatgag 
caggagtttg 
agaaagaaaa 
ggaatgtagt 
taaagcatct 
tatgttgtga 
cacccatcat 
aacatgaata 
cttttataaa 
ttatcatttt 
tttcactgtc 
tattgcatta 
aatattttca 
tcccagagat 
ataaacttct 
ctctttgatc 
tcttgggtga 
gttttataaa 
ttggccactg 
aaattaattg 
gatcctatag 
gaggctccat 
aataccccac 
tttaagggga 
ttcacatggg 
ggagggttgg 
tagagggttc 
gcttcccaat 
agatgttacc 



ccccgacctg 
tatgggccca 
attaataagc 
aattagatag 
aacaatgtat 
agaaataata 
cttggaagct 
tgaggtgata 
atatcacacg 
cacggtggct 
aggccaggag 
aaaataaaaa 
ggctgatgca 
caccactgca 
taaataaaat 
gggctcaaca 
tgccttaacc 
cacttcaagg 
tgcctggcac 
tgaatctgtg 
tggggaaggg 
gagtgaggct 
gaggctggag 
atgactcctt 
tatttaatga 
cacatgttta 
tgtgagccac 
aacatcaaca 
gggttgaaac 
gaatctcaga 
ttgtgaagat 
ccactgatct 
cctcaccact 
tcctgataga 
agaatctccg 
cctgtaatcc 
agaccagcct 
gaataataat 
aagtgtacct 
tgactttatt 
aaataatttt 
tctcccaacc 
cttactttac 
caaatatgat 
gtggctgcat 
tggggaaatg 
tattcttaag 
aggctcttgt 
gtaggaacac 
gctaatttaa 
attaacaggt 
agtagtcatc 
tgtgagcgag 
aggctcctga 
gagaaaatgg 
tgggggcctg 
gtgtccccca 
atacatgtac 
aaggaattgt 
atcacctaat 
agagttgtgc 
tcaaattaag 
gcttccccaa 
catagcacct 



taatgaatga 
ctttggagtg 
ctgggggtgg 
aagtaataag 
tgtatatttc 
aatgcttgtc 
aagcagggtt 
gatatcctaa 
tatcccataa 
cacatctgta 
ttcaagatca 
cgaattagcc 
ggaggattgc 
ctctccagcc 
aaataaaaat 
gtactagcct 
cctgtgtgcc 
gtcattataa 
acagtaggag 
tcctctttgt 
cagagactga 
gggaaaggtc 
cataaggcag 
tatactgtta 
atgattccaa 
aaaattgagt 
agctatcccc 
tacaatgcca 
ccagcttttt 
tgtctagtct 
tcaataaaat 
ccagagttga 
ccgcatgcat 
ggaggaccat 
aaggagctct 
cagctactca 
gcgcaacata 
agtaataaat 
aataaatgtg 
ctataagcaa 
ccaacattaa 
ttcaataatt 
atggtcgcaa 
gttataaacc 
aatattgcat 
gaggataatg 
atcaaatccc 
tatattttac 
agacgtcatc 
taagtatgaa 
tgaactattt 
cacttccttc 
cacttcagag 
gcaggggaat 
gctgagaggc 
aactggggca 
tattagagct 
tcactctccc 
tctgtcctgt 
gaagggatgc 
aaaatgcaac 
agagtctaca 
ccccacctcc 
tgcaccatga 



gagtgccttt 
catggtggag 
ggggggagaa 
ttctaatgtt 
aaaatagcta 
tgcggccatg 
gaacctggtt 
ataccctgtc 
atatgtacaa 
atcccagcaa 
gcctgggcaa 
aggcgtggtg 
ttgagcccag 
taggcaacag 
aaaaagactg 
tgaaccctgg 
tcactttctt 
aaagccaata 
tttaataact 
ccactaatgg 
gggcaactgg 
tgggagacag 
ttcagttttt 
atgttttcat 
ggggattctt 
tgatttaaat 
aaaatcatga 
attccagaat 
tgctaactgt 
gtaaattgaa 
cacaacatgt 
ttcggctgat 
tcctcccgaa 
tcttcagtca 
cagtaaaatc 
ggaggccgag 
gtgagaccct 
cacctgtgca 
atcattgtaa 
taaaagagga 
caaagaacat 
ttcaattttg 
tcagtgttca 
ggtctccatg 
tgactatgtt 
ccagggtcat 
agcagtgaga 
tagattgttt 
caaccttgcc 
atgctatcct 
tccaagtatt 
acctgtttat 
tcaatagaca 
gcatgatcaa 
agagatatgt 
acggcctgag 
tgcggcactt 
ttgcaaatct 
tcacttactg 
catccccaac 
agggaatcat 
gcaactaatc 
atcctagact 
ttgtttgatt 



112320 
112380 
112440 
112500 
112560 
112620 
112680 
112740 
112800 
112860 
112920 
112980 
113040 
113100 
113160 
113220 
113280 
113340 
113400 
113460 
113520 
113580 
113640 
113700 
113760 
113820 
113880 
113940 
114000 
114060 
114120 
114180 
114240 
114300 
114360 
114420 
114480 
114540 
114600 
114660 
114720 
114780 
114840 
114900 
114960 
115020 
115080 
115140 
115200 
115260 
115320 
115380 
115440 
115500 
115560 
115620 
115680 
115740 
115800 
115860 
115920 
115980 
116040 
116100 
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agcacctccc acagtagact gtgtttctga taggtcagca acatttgctg agcacctact 116160 

ctgcagggct gtgccaggtg cacaaaataa acaaagccaa agacaacatg gaccctgaac 116220 

tcagcaagtt cagagtcaag tgggataggg aggctctctt cactggaagg taactccaag 116280 

aaacaatggg actcaacttt ctaaccaaga gaactccagg gagctaaaat tctgacttct 116340 

ggttaagact ggtgtggagc ttcattaaag aagaaaagat tcacccagac ttgagttcat 116400 

agcctggctt tgcagctttt aagtcatgta acctttgatg aagttatgtg acctctccac 116460 

ccagctgccc ctaacacctt gcaggggcag ggctggagtg caaagggagg cactggtacc 116520 

acagcctggg aggcaccacc ccactagtgc aagccgggca acctctgccc ccaaggcatc 116580 

cctagcctcc caactgcaag catcaatctt gcacttggaa aggaacctca cctttgaaat 116640 

ctaggttcaa atttagaatg atccagctcc ttgaagttct atacagaaat acagccagca 116700 

gccaggcccg gtggctcacg cctgtaatcc cagcactttg ggaggctgag gtgggtggat 116760 

cacttgagga cagaagttcg agaccagcct gaccaacatg gtgaaacccc gtttctacta 116820 

aaaatacaaa attagccagg catggtggta cacgcctgta atcccagcta cttgggaggc 116880 

tgaggcagga gaattgcttg aacccaggag gcagaggttg aagtgagcca agatcgtgcc 116940 

attgcactcc agcctgggca acaagagcga aactccatct caaaaaaaaa caaaacacac 117000 

gcagcttccc tccacttccc aaccacagct ccatctcaga caacaagggg cctcatgtcc 117060 

atgcacatga atatccaacc aacatgtcta aggcccaacc acaccctctc caaacatctg 117120 

ccccttggcc accctttggc catgggttca tgcactggca gaaaggtagt tcagagaaga 117180 

agcccacaaa gggccgggaa gtccacttgg gctttttgag attccagggt ccaggataac 117240 

ctaagtgtgg tctagaagag agatgcagct tctgggaggc acattccttg gtcttaggga 117300 

cttcttgccc ccatggaggg aaactggcta gatgagggcc aaagcagagc cctctaaagc 117360 

acagggctca gggaaggact ctttttgacc agatctaaga gcagcactac ctctctgagc 117420 

ctgtttctcc atctgtaaga aggggacatt aatagactct ccccgctaga gttactctac 117480 

atcagccagc acacgtaagt tcatgacatg aagcaagggc t.taatatata cccgttgtac 117540 

tataaataat aggccaggcg tggtggctca cacttgtaac cccagcactt tgggaggccg 117600 

aggaggatgg atcacaaggt caggagtttg agatcagcct ggtgaaacct catctctacc 117660 

aaaaatacaa aaattggccg ggtgtggtgg cgtgcacctg- tagtcccagc tacttgggag 117720 

gctgaggcag gagaattgct tgaacccggg aggcagagtt tgcagtgagc caagatctca 117780 

ccattgcatt ccagcctggg tgacagagca agactgcatc tcaaaataaa taaatacaca 117840 

catacataca tacatacata catacataca tacatacata catacaatac atggacaggg 117900 

accctaaaaa tgagacaggg aaagagaaaa acatgttctg acaaccttgc cctttatact 117960 

aatttaggtt ttcttgcctg ttttagaaag ggcctggaca ggagccctgt tcccctcagg 118020 

ccaggcagaa caaggtgtgg aactcactgt ggaagggttc tgggtgacaa gtgcagcccc 118080 

gtccctccac ctcccagcac agtaggcagc acgtgtctcc attgactggc tcaggagcag 118140 

gcctggtgac cagtgggaga gctgaggagc ccagggtggg gtctgaagga atccctagaa 118200 

aatctgattt tcccccaggg cccacatcac gtgcccagag ctgggaaagt ggaggcagca 118260 

tgggatctag ctgagaggct ccatttttgg tagcttctag tttgggagtc acagagacac 118320 

ctggatgata cgaagatgta gctttgcagg actctctaga acatggagtc caagatattc 118380 

ccttcaatga tgggacactg aagcccacag aggagaggtc tgtcccagtt actcagccat 118440 

tcggaggcag agaccaggct agaactcagg acttttaatt tggaccagga ttccttttac 118500 

cacagtgggc agccctagca agtgccaggt agggtggaac tgtgaaggtc atccgagggg 118560 

tagtacacgt gggtaggaag tcatatctaa gaactgaccc ccagacctgg ctctgccact 118620 

cactccttat gagaccacag gtgctgggtg cagtggttca cacctgcaat cccagcactt 118680 

tgggaggcca aggcaggcag attgcttgat tccaggagtt cgagaccagc ctgggaaaca 118740 

tagtgagacc cccacctcta ccaaaattag ccaggcgtgg tggtgtctgc ctgtagtccc 118800 

agctacttgg gaggctgagg tgggaggact gcttgagcct gggaggcgga ggttgcggtg 118860 

agccaggatc atgccactgc acaccagcct ggatgacaga gtgagacaga atgacacact 118920 

gtctcaaaat aaataaataa atgacagcag atcatcattt ttctttctgc ctctagactg 118980 

caatgcctat ttctccaggt agtcactagg ataaaagtaa aaataatatt atcagcattt 119040 

accaaataca gggtcagcta ctctgttatg ttctttcatg ctttgtttct tttaagcctc 119100 

aaacaactct atgagctggg aacaagtatc gtccttcttc ctcccatctt atttatttat 119160 

ttatctatgt atttatctat ctattcattt atttatttat tttgagacaa ggtccttcta 119220 

agtcaccagg gatggcctca aacttgagct aggaactagt gtcaccaccc cccaatttct 119280 

tttattgatt gattgattga ttgattgact ggttaatttt gaggcagggg tctcgctaag 119340 

tcgacagggc tggtcttgaa ctcctagtct taagcaatcc gcccgcctca gcctcccaaa 119400 

ttgctgggat taccaacacg agccaccatg cccagcccct cccatcttct gaataggaaa 119460 

actggggttt gaaaaggtaa gcgacttgcc caaggtcccc tccctagcta gagagcttca 119520 

gagccagggc acaaacccat caaagcctgt gctctcgccc attgagccac cggacctcgt 119580 

acactaaccg ccaagtgttc tacacagtga aggtgacaaa gaggtgaagg gaagagccag 119640 

ggaggttctg tggactcact cggtgggtat gcccagaggg aagggggatc ttgggtggca 119700 

cattgagagt agctgcgctg ttagtaagtg agaactcgga agtccagact catccagtct 119760 

gtgccaataa acccctcctt ctacctggtc tcctttccaa agccagctgt tctccagaca 119820 

atggggtggg cgggggcggg tgtcctcctc cttctcaggg aaaatccgac gctgagccca 119880 

tctccagaga tcttggcttc ccgtggggct gcagatccac ctagagccac cagagggcgg 119940 
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gccagcactg 
ccagtgtcca 
agcccaactc 
agaattactg 
acctcatttt 
cgatgaccgg 
attgtctagg 
ctccccccaa 
gtaagagggg 
atgagaccag 
gctggaaaca 
tgggggtgga 
gcggggatga 
ctttcctcgc 
cttgggggcc 
gacagcacac 
gtgatatttc 
acccagcgac 
acccccaaca 
tcggggtggg 
ctgggtgagg 
acgagagggg 
ccgaaaatgg 
ggaatgaaac 
acgagagaca 
gacagtgaga 
gagagagagt 
cgggggagaa 
agcagagaat 
cactgcattg 
aggggcttct 
cacctgcggc 
gctcccccga 
gagtcaagac 
ctgctgatac 
tccgcccctc 
gctttgggtc 
acatacacaa 
cactaagggg 
gcgcaaaacc 
ttcagcggct 
caaagggtaa 
cacgctcata 
gcacactggg 
ggacagaaat 
gggtgaccca 
cggcgttcct 
gcgaggacga 
gagggggtta 
cagcaaacga 
cctcgctgtc 
gactctcgag 
cagccttccc 
ggcccgcagc 
tcccgggccc 
gagtgggagc 
gggctcaggc 
gggagagaag 
ggtaacctgg 
tctcagcagg 
ggtgggacgc 
tgtccccgct 
actcagcaac 
atgtgggtag 



cggccaaggc 
agagtccagc 
atgttcaggc 
ccaaaatagt 
accaagcact 
gtagcctcac 
gtcacctgga 
ccctaaccgg 
tcagagtgct 
ccagccaccc 
cgacaatgga 
gggggtgagt 
gcagaggggc 
agctcgtccc 
cggggtcccc 
ctgagtcagc 
gctccccggg 
agattgtgcc 
ctccctcccc 
tcctggggaa 
gaagacggga 
cgggaaagga 
aaccagcagg 
tggggaagag 
gagacgcagg 
aagacagaag 
gagggaggga 
aaccagagaa 
gaatgggaag 
gaccctggca 
gtgctcgcgg 
tccagcagcc 
gggccccgag 
ctgggtcaac 
cgcctgtgac 
tttaaaaggg 
ctctcctgca 
aaaaacgcac 
catacacaca 
tcgcacagcc 
cacccgtgtg 
cgggcaagca 
tgcactcgat 
cttgaggtct 
gctggaggct 
gaagctcatc 
ctaaactacc 
tcgaacacag 
gaaagccgct 
gggaggagcc 
cccactctcc 
cggacagcgc 
cgccccgtcg 
ccgggcggcg 
tcccggagcc 
gccctccccc 
gcctgctgca 
cagggcgtcc 
ggaaggggaa 
agggcacttg 
agggggcagg 
ccacctgctc 
tcctcctgcc 
ggggctgtct 



ttgaagaccc 
cagaggccga 
atgatgtgtc 
aacgacatta 
ttctcatgcc 
cgtacagatg 
gaactctgat 
agctaggtgg 
ctacagagaa 
acccaccagc 
tgacacgagt 
cagcagataa 
tgcgactgcc 
aggaggagga 
tcccacccct 
ccgccgccca 
agccagcccc 
gcggctcatt 
gccgccgcct 
acgcagggtc 
gccccagaga 
aatttagggg 
gggcacccga 
ggggagagaa 
gagccccgcg 
accgggcagg 
gggaacagag 
atcgaatgag 
aataagacca 
cggataggaa 
gagcggcagc 
ccaaccccgc 
cctatcctac 
cgccctgcag 
caggccatga 
ggtagaggaa 
cccccgcggg 
acactcgcac 
ccgggcacat 
tcacgtttcc 
cacactcacg 
tcctgagtca 
cttgcacgca 
gggagtggaa 
gggacactgg 
ttctcctgga 
gcattccccc 
tcctccgggt 
cccgcctcct 
agcgagtgcg 
ctcggctagc 
agctagcggg 
cgccccgccc 
cagggtagag 
ccgcggggtc 
cgctgccccc 
aggtaagaac 
cctctttcag 
aagctcagcg 
gctgggagcc 
tggacccggc 
cgggggacgc 
cggtgcctca 
ggggaggtag 



agcacaccaa 

gtcctcgatc 

tgattctact 

gctacctacc 

tggggtgcct 

aggaaactga 

ctccagactc 

ggtggggaca 

gaccaaatgc 

cagccaccta 

tccctgccct 

ttatgggaaa 

tggagccagg 

gctccccccc 

ccccgaagag 

cccgcccctc 

actgcgctcc 

ccggggaagg 

ccaggccctc 

ctgtcctgcc 

cttttctttc 

cacagagagg 

gagccgaggt 

agggaggcag 

gggaggagga 

gaaacagacg 

agacagagac- 

aacgcgagaa 

acatttatca' 

aggaggagcc 

ccagggggct 

cagcgctgcc 

gccggggcgg 

cctttgtagg 

agggccagag 

gttcacgcga 

agataaggtc 

gcgcgcccat 

atttctttcc 

caccagctca 

tgcccccccc 

cacctgcaca 

caaactcttg 

ggaaaagtgg 

cgcgagggac 

aagttgggag 

aagaagggat 

cgcttaagcg 

agtggtcgag 

ggaaggagtg 

agcctgggca 

gcgcgggcgc 

cgtcccgtcg 

cgccgcggcc 

cccgccgtgc 

tcccccgagc 

gccagcggcg 

ggattgaggg 

ctggggccgc 

cgcgggcgcg 

ccggagcggg 

tgaggactcg 

gcactttctg 

gaggcgcaga 



agcccggcca 
tcaaaatgtc 
gggacaatca 
acccctccac 
ggagacttaa 
ggcacaagga 
aaatttccaa 
gcaaatgtgg 
attgtggcac 
aaagtcttca 
caaaaagctg 
ccgtgacacc 
gattcccgga 
agcttcgggg 
cgcgggcccc 
agcgtctgtc 
ggaggcagct 
acgccaaacc 
cccccaggcg 
tcctggaaat 
tgtttctacc 
agctgggggg 
gcccacgggc 
agacaccgag 
gagagacgaa 
agtagagaca 
cacgaaatat 
gaacgagaga 
agagccgact 
gcggcgcggg 
cagcagcccc 
tggccaccgt 
ctccgcggac 
gaagtgccta 
gggctccagt 
agccaacagt 
tcccctcccg 
ctcgcacccg 
acccatcccc 
gacatgcacg 
cccgcttccc 
agcatccttg 
catatactat 
aatcttggag 
gcggctgggg 
gggggaacag 
ttctctagaa 

ggggggaggg 

aaagggttaa 
ggggtggttg 
cacggacaga 
tgggcgtcga 
gggccgatgg 
cggccacgca 
atccggcggg 
atcgagacaa 
ggagagcgga 
tggggcagtt 
gcccccgccg 
tgcgaggagc 
gagggaggct 
ggccggctgg 
gccacctggg 
gggaaatcca 



agcctccagc 
taactgcaga 
ttgccaccaa 
acaccaacac 
tgcagcctcg 
aggggagtac 
ttcgtccccc 
atggggggag 
ctactgtaaa 
gtgggcacct 
gtagtctagt 
tgtataaggg 
cggggcttcc 
ttccgcctgc 
gggaaccgat 
tccgcatctt 
cggcaaacaa 
ccaccctgct 
caggccctag 
agggggagcc 
tgatccgaaa 
ccgagaaggt 
cgggagcctg 
acacacagag 
gacacagaga 
gaaaaggtcc 
gagtaagagt 
ccgtggaggg 
gtatgccagg 
cagcggggcg 
ggcaccgccg 
acccgaagcg 
gcgccgggcc 
ggtgatgggt 
gagaccataa 
cttctcccca 
gacacatcat 
cttgtaaatg 
aagatcgcaa 
ctggcggact 
caagcccgta 
cgcgcacgtg 
tcttatagtc 
ctgtcccagg 
gcgggggagg 
gacaagtcca 
gagtggcgcc 
gggcggggtg 
gtcggcaagc 
ggaagagctt 
cggactgacg 
cggccagccc 
ctcctcccga 
gcccggggac 
ctcagggagc 
gatgctgccc 
gggcatcctg 
ggggaggtgg 
ccagggctgt 
tcgtgaccga 
caggttccgc 
ggaagcgccg 
aagacaggag 

agtggccctc 



120000 
120060 
120120 
120180 
120240 
120300 
120360 
120420 
120480 
120540 
120600 
120660 
120720 
120780 
120840 
120900 
120960 
121020 
121080 
121140 
121200 
121260 
121320 
121380 
121440 
121500 
121560 
121620 
121680 
121740 
121800 
121860 
121920 
121980 
122040 
122100 
122160 
122220 
122280 
122340 
122400 
122460 
122520 
122580 
122640 
122700 
122760 
122820 
122880 
122940 
123000 
123060 
123120 
123180 
123240 
123300 
123360 
123420 
123480 
123540 
123600 
123660 
123720 
123780 
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tctggtagga 

aagggtgtgg 

aggagaactc 

gggggagggg 

ggtgaagacc 

agactatgga 

gggaccctgg 

gggatggtga 

atggatggga 

agggactggg 

tccacgtggg 

gtggctggtg 

gacagagggt 

gctcctgtgc 

accacggtgg 

cagctctaaa 

accaccacct 

tgcccgcagg 

gtctgggacc 

atcctttcct 

ctgcccccta 

cggcccaaag 

ttgatagcca 

cttcagcctt 

ttctggccat 

tctagaacaa 

ttccttgcct 

gcacctgggt 

cgtggtttct 

tgggcttgat 

ggccactgtg 

ttgacctcca 

tgcagtgcct 

ctgcagctct 

aggctgctac 

ctgagaaaga 

aactcacact 

taggtggcca 

ctccaccggg 

ggaggcctgg 

gggaagggac 

ggggagagag 

ccctggagtg 

cctcccctcc 

gttacagctg 

gcagcagctg 

ccactatcat 

aggagagtgg 

ctagggagag 

cggaaggaac 

ctgtctctcc 

gggctagcgg 

ggatcagaga 

ggtgactctg 

tccctgccaa 

tcatcacctc 

gaaatctcca 

tgtgcttcca 

accccacgtg 

aggggtgaga 

tgtcactgag 

acgcggagga 

gacactcacg 

ccgcacctgt 



gagatggagg 
gccagagact 
aaagatggag 
gagaaggaat 
aaggaagagg 
cagggaccca 
ggactggaat 
ctagagaagg 
gaggggctgg 
gattgggggt 
aagccctcca 
ggccagatgg 
gaggactttg 
tgccatttat 
ctgcttattt 
cctccgtcac 
ccatcccact 
ccccagccct 
ctgtgcccct 
gggaccactc 
ctttgggtcc 
agtggcctgt 
gagtcctgga 
ccgggatcaa 
atattctgga 
ccgcaactgg 
tgaccctgaa 
gcaatttctc 
gcggaggatg 
gtccctcttt 
tcatgtttct 
ggggattatg 
aggacaaaat 
agccatcccc 
aaagggcttt 
caatgaaggc 
tcccctcccc 
ccgaggtatc 
aattcacaaa 
gcctggaggg 
gtggcaagaa 
ggaaaggcac 
ctcgctccag 
tccttgcctc 
ctttgcacgg 
ttatggagga 
gggctggggg 
ggaaagagaa 
aaagaggagt 
cctgatgcca 
tggggcagcg 
ccatcagggc 
aggatttggc 
acaggaaggg 
cactttgctg 
agtagaactg 
gttctaagcc 
cactccacct 
ggtctaagaa 
agggccgaga 
tattacatca 
cagccagcac 
gtggggccca 
gtgctccccc 



gcgctagaaa 

gggggtgggg 

aaagtggtga 

aaaggttaga 

gaaggagagg 

gaatcctggg 

ggaaaaggag 

ggtatcagga 

agaggaggga 

gctgcccagg 

ttcccaaagc 

ggggtgcttt 

gaaggggagt 

ttatccggcc 

tggggtgtta 

ccgcgtcagt 

cctcagctcc 

tccctgaagc 

ccttccagag 

tgcaccagcg 

agttttcttc 

gggctgagaa 

cagtggtccc 

taccatgttc 

caggggtcat 

aagcctagga 

tagccaaaca 

tcctgtcttt 

tggccacctc 

tccaatctgg 

ggggctccct 

tgtgttgtgt 

aggtgcttct 

tgctgacttt 

cctctgtcca 

gagctgtaga 

agccctcaat 

cccctttcca 

atgaaggcca 

aaggagtggt 

agaaaagggc 

agctctctag 

ccccagcagg 

cagtgagagc 

cagtggcaag 

gcccccagca 

aggtgagccc 

gagaagtagt 

aggcaccctt 

ggggaatggg 

gtcagtccgg 

tcacactctg 

ttcttttggt 

aagtgccctg 

tgtgaccttg 

atgcttgaac 

ccaaacctca 

ctgcctggtt 

actcgggagg 

aacgcggagg 

cacccagcct 

gcaccgacgc 

aaagcgagga 

ctctggagga 



gaggatagtt 
tggggagggg 
gggaagctca 
tgggaaaagc 
aaagaccaga 
atggaggtag 
aatggaaaga 
accggcgaag 
agaggaggag 
gatgagacaa 
gcctgcctgc 
gagctcaggg 
gacagcctcc 
cggacgttgg 
cattctggca 
gcctccccag 
cacctcctca 
agcccgttgg 
cgagaggcct 
actctgccct 
tcctcaagtt 
ctttgtttct 
tcagtgaaca 
tggcctctct 
ctcttcttga 
ggtgaaagtt 
gactcagcat 
acctcaaggg 
ttaaagaaag 
gttccacctt 
tccttcatct 
tcaatgtata 
tggtttcctc 
agaggagggg 
tgaagtagtg 
gacctggtca 
ggtgttacct 
gctctgatac 
ggagtggagc 
ggtgctgagg 
acacactggg 
tcccccaacc 
cctggggcag 
cgctgcgtga 
ggccagaaat 
ccgggtcgct 
tggttggggg 
ctaatgtggg 
gccagctcct 
ccttgcctct 
aggctggggg 
gtgagcttgt 
gtcaagtcct 
gctgggcatc 
ggtaagtcgc 
cagaggaatc 
accctcatga 
cagatgaggg 
agaaagtaat 
tggtgtgaac 
acacacgcac 
agcaccgacg 
gcagcacact 
ggaacaccag 



ctactgattg 

tcagggggag 

aaggaggagg 

gtggagggaa 

tcaggggagg 

cgggaaagag 

tcagaaacca 

agggttggag 

gaagagaaag 

agaggcttct 

cacatttctt 

ccctgggggt 

gagggtgggc 

attctgcagc 

gagtgagaag 

gcccctgcgt 

gcccctgccc 

gtgtggagcc 

ctgctgcctt 

gtgggtgggt 

ccttcttcta 

gagccttggt 

gatacttttg 

tggctccctc 

ctcccacatg 

gcagagagag 

tgtggctggc 

cagtgtctca 

atcagagtgt 

gtactagctg 

gcaaattggg 

aagaagttaa 

ctaccctgct 

gtgagcagag 

gagggatgaa 

ggaggcctgg 

atgatgtgag 

tctgtgcatc 

cgtggtcctc 

aggagtgaga 

cagggcaggg 

ccccagtccc 

tgaacfcccag 

attatggatg 

ggcaacagag 

cttcagagag 

agacatggga 

caggtgggga 

gcagagttta 

gagattgcac 

aaagctctgt 

ggataagggg 

tagggaagtg 

aagagacttt 

ttgctctctc 

gaggggacct 

aacccactca 

gtaagagaca 

cgtgaaacgc 

gaatggaaca 

ggggcccggc 

cagcgccagg 

gggagtgtgg 

ggcagctggg 



agtgacagat 
agggatagga 
gagatggagc 
gtgggaccca 
gatgggaaga 
aatcaggact 
gagaaggatg 
acagggaacc 
gctgagagag 
ggtaaccact 
ctctcaggga 
ggctgtgagg 
aaacaggctg 
cgctgccgcc 
ctgtttgcag 
cactggcatc 
cctcagcatc 
cttgcttctc 
tccagggagc 
agcctggatc 
caggggcctc 
actccaaggt 
gctctggaca 
ccctggtcag 
taatcactac 
ctggagtccc 
ccagccctag 
cacattcagg 
ctctctgaca 
catgacctga 
ggccacaata 
cctgtacaaa 
gtactctccc 
agggtggggg 
atgaaggctt 
ggtgctcaga 
gggtcggctc 
ttgccccagt 
gggagagaca 
acagggggtg 
actgagggcg 
accacctctg 
agccccctcc 
agctccttgg 
tcactgttat 
cctgcaggga 
acaagatgga 
gcaggagagt 
ccctcaaggc 
atccttccct 
aatcctccag 
taggattaag 
gagatcagag 
tctggccctt 
tgagctccag 
ttgcggcttt 
gggtccccac 
ttgctcctcc 
cgcacggggg 
gcagccgctg 
gctcacacac 
aggggccggg 
atcttccacc 
atgccagcgc 
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cacactcggg 
gagcacctgg 
gcaacctcag 
gccttggaga 
cccaaggagc 
agcctaggac 
cctggctttg 
aggagtcacc 
tatcagctat 
tatgagtaca 
catcagtatc 
aaaaaaaaaa 
gccaaggtgg 
gaccccatgc 
tgtcacccag 
gttcacacca 
atgcccggct 
aggatggtct 
attacaggcg 
ggcacgggcc 
agtgatccaa 
caagaccctg 
agcctggggt 
aggactggct 
ggctctgact 
tggcaagacc 
ccaagttact 
atgtctggag 
tagaagacag 
ccggcccaac 
tctagacaag 
ccaatttgca 
tcagaaagtg 
catgggagca 
ggtcttgttg 
aaactcctgg 
acatgccacc 
tgttgcccag 
aagtgctggg 
ttggggggct 
tcccctcccc 
ggcggccccc 
ccaggccaac 
cctgcagaac 
cctgtggctc 
agccctggag 
cttccagggc 
gcccggcaac 
cctgctccac 
ctctctgtgg 
acccttcctg 
ctccaggggc 
tctctctctc 
gtctatccct 
tgcctcttct 
cgtgtgtgta 
cccccagggt 
gcccacatca 
aagttgcgct 
gattcccatt 
' tgttatgcag 
attgttatta 
aaatcatccc 
aaaatcttcc 



gcctgtcagt 
ctcacgccac 
gagcgtgaca 
ccccacactc 
cccaggatgt 
gttgcttgaa 
ggggtgcagg 
tggaaatcat 
agtgacggca 
gacacctgca 
ataataagga 
gctttggctg 
gaggattgtt 
ctacaatttt 
gctggagtgc 
ttatcgtgtc 
aaattttttt 
cgatctcctg 
tgagccaccg 
tgtggtccca 
gctgcaataa 
tctccaaaag 
gtaaaatgca 
ggtggcctca 
ctgcaagggc 
agaccccaga 
tccagtcatt 
acattttggt 
agatgctgct 
tgtcaatact 
gtcaaacatg 
gtgctttcta 
gaagtggcca 
ccagggctct 
tgttgcccag 
gttcaagcga 
atgcctggct 
gatagtctca 
attacaggtg 
ggcccccagc 
acagctcccg 
agctgcccca 
aacttctcct 
aacctcatcc 
ttctccaaca 
gagctggacc 
ctggagcggc 
atcttccgag 
ctacaggtga 
gcccctctgc 
cctcagcatc 
tttacttttt 
tctccctcta 
ttccatccat 
gtctgtctcc 
tctgtctctt 
cccctgccca 
cggtgaagta 
tctctccaat 
ttccaaacct 
ctggagaatg 
ttattgatct 
actccatttc 
cttggctaca 



cccatgcgtg 
tacccaaaat 
caacacacac 
aaaatcacca 
cagagtgcag 
gcagaaggtg 
ctcccttttt 
aagaaagtgt 
aaggccaggg 
ctccactctc 
tggatcatat 
gatgcggtag 
tgaggccagg 
tttttttttt 
agtggtgcga 
tcagcctccc 
tttgtatttt 
acctcgtgat 
cgcccggcca 
gctcctcagg 
gctgtgatcg 
aaaaaaagaa 
gattcccagg 
cttagagacc 
gaaaagtaca 
gtgagtccat 
ctccaatggg 
tgtcacaact 
aaatgcctta 
gaggcagaga 
caatagtgaa 
caatggcctt 
ggccacttga 
aggcctttat 
gctggagtgc 
tcctcctgcc 
aatttttttt 
aactcctggc 
tgagccacca 
tggcactcct 
cctcggcctg 
tgctctgcac 
ctgtgccgct 
gcacgctgcg 
acctctccac 
tcggtgacaa 
tgcagtcgct 
gcctggtcag 
gcctgccctg 
tccccgaccc 
tccatttctc 
cccttctgcc 
actccacaac 
cactgcctct 
cttcacacac 
tctgtgatct 
aaggcctttg 
gagagagaag 
tcactgggca 
gtcatctcaa 
gatgctctga 
aattattgtt 
cccaggaagc 
gagcctccgg 



cacacctggc 
cacagataca 
aaaaccacca 
acccctcagt 
aaacaagtct 
ttcagtcact 
ccccaggcaa 
agaggtcaag 
atgatgggag 
tagcccccag 
ccaaccttca 
cttatgcctg 
agtttgagac 
tttttttttt 
tctcggctca 
aagtagctgg 
tagtagagac 
ccacccgcct 
aaaattttta 
aagctgaggc 
taccactgca 
agaagttttt 
ctgtcccccc 
cgacccttaa 
ggaaagtaag 
ttcacacggg 
gtgactttgo 
ggaggcaggg 
tatagggctg 
aaccctgacg 
aacaggaatg 
ttggcattat 
ggctataacg 
ttttattttc 
aagggtgtga 
tcagcctccc 
tttttcttgt 
ctcaagcaat 
tattcagccg 
gccctggaag 
cctcctgctg 
ctgctactca 
gtccctgcca 
gccaggcacc 
catctacccg 
ccggcacctg 
gcatttgtac 
cctgcagtac 
cccccaccct 
tggcgtgcgt 
tctgtctatg 
tctctacctg 
cttcacctct 
ctcactaact 
ccactccgca 
cacgtgtttg 
cagctgtttt 
gcagagccac 
atgggacggg 
tgcaggggaa 
aaatggaagg 
tattgttgtt 
aataacacac 
ctggaagggg 



tgagcagcac 
tacacacatt 
ctaagcaagt 
ctctcccagg 
tcctcccctc 
gtgtgcccag 
aactgccaga 
ctagttccgg 
gccctgcacc 
gctctctggg 
aaagttactt 
aaatcccaac 
cagactgagc 
tttgatacag 
ctgcaagctc 
gactacaggc 
ggggtttcac 
cagactccca 
aaaaattagc 
aggaggattg 
ctccagcctg 
aagtaactgc 
aggaattctg 
ggcccctccc 
ggcactgggc 
cctcagatct 
cccccagggg 
tgctgctggc 
cccccacaac 
ttagtctttt 
aagagatgat 
tttttaatat 
ttgtcccctg 
tattttttcc 
tcgttgctca 
aagtagttgg 
aaagacaggg 
cctcctgcct 
ggtctaggcc 
cccacctagt 
atgctcctgg 
tccccgccca 
cccagcactc 
tttgggtcca 
ggcactttcc 
cgctcgctgg 
cgctgccagc 
ctctacctcc 
cagccccttt 
ccctcctctc 
tctcttttct 
tttaggtccc 
ctgcctctgc 
tgcctccccc 
tacaccccca 
ccttcagggc 
tctcacccac 
agccactggc 
agaagcccac 
gaaagaaaag 
aataccagta 
atgctgactg 
cctccaaacc 
gtgaaaatat 



tgcatttggt 
cacgcacacg 
gcaatttgca 
gtctctgaac 
tgccttcaaa 
ggaatgactg 
agaaaatccc 
cctagaactt 
cctattaaaa 
cctgcttttc 
tgggggaaaa 
actttgggag 
aacatagcaa 
agtctcgctg 
cgcctcccgg 
gcccgccacc 
cgtgttagcc 
aagtgctggg 
tgggtgcagt 
cttgagccca 
ggcgatggag 
gaatgaggag 
catagttcct 
ggcacaaaga 
accagtgggc 
ccaaagggtc 
acatttggca 
atctagtggg 
gaggaactat 
gacattaatc 
cattcttcaa 
atgagaagcc 
agcccccaga 
cctgaaacag 
ctacagcctc 
gactacaggc 
atctccctta 
tggcctccca 
ttttaccaag 
aagttctgct 
ccctgcccct 
ccgtgagctg 
agcgactctt 
acctgctcac 
gccacttgca 
agcccgacac 
tcagcagcct 
aggagaacag 
ctggtttcct 
tccccaggcc 
ctcttacatt 
ttgctgttcc 
ctgtctgtct 
atctgtcttc 
tgtctgtctg 
actctgcctt 
cctcaagtct 
atcccacaga 
accccttcta 
ggtaaatctc 
attgttattc 
tttgacacgc 
accctgagag 
ccaaattctg 
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ccctctccct 

ctagttatca 

caatagccat 

tacagtagtg 

gagggaaaga 

ttgcctaaat 

gccacagtgc 

ttttttttac 

catctcctag 

gtgcttggca 

aacgagcctt 

atgtgggtca 

acagtttatg 

ttttcacctc 

gatatagtca 

gtcttctcac 

tagggacatg 

tcatggtctg 

aagcacacag 

tttggacaaa 

ggatagtact 

tggagtgcag 

ttcctgcctc 

tttttgtatt 

gacctcaggt 

ccatgcccag 

atttagcaca 

actactacta 

atgctttcaa 

gggcctgagc 

ctaaccctct 

ttccaccaag 

gctggtgtgg 

acttgaagtc 

aaatacaaaa 

taaggcagga 

actatactcc 

aaaagcccat 

ctcaggccaa 

caggtggaca 

atggtaccag 

gaaaacacag 

aggtgaggaa 

ccaggcctgc 

gaatggggaa 

gtatctacca 

gtggccctgt 

tgttttcttt 

agtggcacga 

tcagcctccc 

tttttagtag 

ggtgatctgc 

gccgacactt 

ttttgttttg 

cgcaatcttg 

tccccagtag 

agagacgggg 

acccgcctcg 

ctgaatggtt 

ccacacctct 

ggacccatct 

aggagatact 

tagctatcca 

ccaaggtcac 



acttgaacct 

atctgctagt 

aacaacctaa 

aacagaacag 

agtaaaatgc 

agtgacagta 

caagcacttt 

ctcctcagac 

ggattttgaa 

tacggtaagt 

atttaacatt 

acttcaagga 

aagtgcactc 

cttgttaaca 

cctccaatca 

catcatgttg 

gaatgaatgt 

gctacaaggg 

tttggactca 

ttacttaacc 

taataataat 

tagtgcgatc 

agcctcctga 

tttagtagag 

gatccactca 

ccaataataa 

aggattaaaa 

ctactactaa 

aatctcctta 

ttgaatcccc 

ctgaacctca 

aatggctgta 

tggcccatgt 

aggagttcga 

attaggcagg 

gaatcacttg 

agcctgggtg 

agttcagtgc 

caacagcaat 

ttggggtctt 

gcatcatggg 

cctctgcctg 

acatgggctg 

cccaaagggg 

gggccaggca 

catgccaggc 

tctcccagag 

cttttttttt 

tctcggctca 

aagtagctgg 

agacggggtt 

ccaccttggc 

gttttctctt 

tttttgtttt 

gctcactgca 

ctggaactac 

tttcaccatg 

gcctcccaaa 

taaaaatagt 

aatttgctct 

gtgaaatggg 

actgagctca 

ttaacaccag 

ccagctagta 



ggaacgtgct 
tggaaaatca 
caaacatact 
accaaacccc 
acggtatatt 
atagcagtag 
ataggtatcc 
ctcagttttc 
agcattaaat 
gctatataaa 
ggtttcagtg 
ttatactaag 
agccacctca 
atggagaagc 
taaatctttt 
agcctcacaa 
tcctgaggcc 
gacagtactc 
gcaagaccta 
tctctcagtc 
cttttttttt 
tcggctcact 
gtaactaaga 
aagggtttca 
cctcggcctc 
tccttattta 
tgtaaggcat 
tactgagatc 
tcaatatata 
gatctactat 
cttactttat 
aataggaaac 
ctgcaatccc 
gaccagcctg 
cggggtggca 
aacctgggag 
acagagcaag 
tgaagaaatc 
caggacctga 
ctgaaatggg 
aaggaagcaa 
tgaactgcct 
gggtgagatc 
agctttggag 
ctgtgccagg 
aatgttccag 
catattccct 
tttgagacga 
ctgcaacctc 
gattacaggc 
tcaccatgtc 
ctcccaaagt 
tcagtcatta 
tgagacagtc 
gcctcacctc 
agacatgtgc 
ttgcccaggc 
gtgctgggat 
ctttatgctc 
atggctttgt 
gataacctgt 
cagcccaatg 
ggaggacacc 
aggaacatga 



tcctctgcct 

ggtcagtgct 

gagcacccac 

ctgccttcac 

ggaaaaatat 

ccgccaccac 

actctgccat 

tcatctgtac 

gcatgaataa 

tgcttgttaa 

aagtggccca 

gtcatgagtg 

tctcatttct 

tgaggctggg 

caaccattgt 

caacctggtg 

acacacccag 

tggagtacaa 

ggttcaaatc 

tccatttcct 

tgagacgacg 

gcaacctctg 

ttacaggcat 

ccatgttggc 

ccaaagtgct 

agaagttttg 

ttagcacata 

aaatactact 

ttagttattt 

tttctgactt 

ctgcaaactg 

gagttagtgt 

agcacttcgg 

gccaatatgg 

ggtgtctgta 

gtggaggttg 

actctgtctc 

atgttattat 

ggtcagcaaa 

aagtgtttgt 

cttcacacct 

ggtagggctg 

cgcagggtgc 

gaaactccac 

tgagttcatt 

gtgccaggga 

actcaagtgt 

agtctcgctc 

tgcctcccag 

atgtgctacc 

ggccaggctg 

gttggattac 

cagtggcctg 

tcactatgtc 

ctggggtcaa 

caccatgtcc 

tggtctcaaa 

tacaggcatg 

aagcagatca 

gcaagttatt 

accttggcga 

tctggtacaa 

aactgaagct 

cctagaattg 



catccagggc 
gatgatgcta 
tacgagctag 
agagatacca 
gtcttatatt 
ttagtgggta 
ttacaagcgt 
aatggggtag 
tttgtaaacjc 
aatactattt 
acttggactc 
agtcccagaa 
acagcccagt 
ggccctgaag 
cggtgtgacc 
atagggacag 
gaagagctgg 
ttgagcaggc 
ctggctccta 
catctctaaa 
tcccactcta 
cctcccaggc 
gtgtcactac 
caggctggtc 
gggattacag 
taaggattaa 
tgggcactat 
acaaattgat 
aggaggaatt 
atttaacttt 
ggaataatga 
atagaaagcc 
gaggccaagg 
tgaaaccctg 
atcccagcca 
cagtgagctg 
aaaaaaggaa 
gaccccatcc 
ggcttgggca 
tctctacgcc 
ggccttttat 
ggctgggaga 
aggtgtgacc 
caga'ggacca 
catcaacaga 
ttcaggagag 
agccagatga 
tcttgctcag 
gttcaagcga 
atgcctggct 
gtcttgaact 
aggtgtgagc 
catggttttt 
acccagctgg 
acaattcccc 
agctaatttt 
ctcctgaact 
agccaccgta 
gatctcagtt 
taaccactct 
gcaggggttg 
agtgagtatc 
cagcaaaata 
gcccaggtct 



tagtgcctaa 

atgataataa 

atgctaagaa 

ttcccatgag 

attcttattg 

cacagggtca 

gtgacatttt 

caagagcacc 

acttagaata 

taaaaaaaga 

catcctgaag 

attgcacctc 

tgggagatta 

accctataga 

ggaggcttat 

ttaggggcac 

cgcttgaacc 

tcatttttga 

tatatatgac 

atggcaatca 

tcgcccaggc 

tcaagtgatt 

acccagctat 

ttgaactcct 

gtgtgagcca 

aatgtaaggc 

aataataatt 

catgcattta 

tggagtcaga 

aagcaggttg 

aaataatacc 

catagttcag 

tgggtggatc 

tctctactaa 

ctagggaggc 

agatcgtgct 

aaaaaaaaaa 

tccattgact 

gaggggacct 

cctggcatga 

agaggagatg 

tgccacaggc 

caagatggag 

cagcttttca 

tatttactga 

aacagaaaca 

taaagacact 

gctggagtgc 

ttctcctgcc 

aatttttgta 

cctgaccaca 

caccgcaccc 

gtttgttttg 

agtgcagtgg 

catcttagcc 

tctattttat 

taagcaatcc 

cacagctggc 

tgaattccag 

gagcctcgat 

tgaggattaa 

caatgaatgg 

aaagcacagt 

gtctgactcc 



131520 

131580 

131640 

131700 

131760 

131820 

131880 

131940 

132000 

132060 

132120 

132180 

132240 

132300 

132360 

132420 

132480 

132540 

132600 

132660 

132720 

132780 

132840 

132900 

132960 

133020 

133080 

133140 

133200 

133260 

133320 

133380 

133440 

133500 

133560 

133620 

133680 

133740 

133800 

133860 

133920 

133980 

134040 

134100 

134160 

134220 

134280 

134340 

134400 

134460 

134520 

134580 

134640 

134700 

134760 

134820 

134880 

134940 

135000 

135060 

135120 

135180 

135240 

135300 
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agagtgcagt 
ttggacccta 
agataataat 
gtagcacaga 
gccgatgatg 
cacagagtag 
aattcataaa 
tcacagatga 
aagcagctca 
aagaacagga 
cagagagctg 
ggatgacttt 
aggggcgggg 
ttttaacagt 
ggatttggca 
gaagggaggg 
agaaattctg 
ctctggaact 
tgctcactct 
tgtcctactc 
agggattcca 
tgagtcaggc 
gggtggtggc 
ggacggatct 
agggcggaag 
tgggtgtagg 
gtggcaggca 
gggaacaggc 
agacctcctg 
cagtgaggtc 
ccccagtgct 
aagtcagctg 
ccactggccg 
caaggtcctt 
gacagcgcct 
cctctggggg 
ttcccccagg 
tattgtcata 
cttaacagca 
gaatattgta 
acagctaggg 
cttcataatt 
tctttatgtg 
ccccatttta 
taatggctaa 
ctcatggaac 
ttcatttaat 
tgcccacggt 
ccagtctcca 
gccacactgg 
ctcctgaatt 
aatacacccg 
aggttagagg 
cttctgccac 
tgcctcaggc 
gcggacctgg 
gagcacgtgt 
cagggcgtgc 
aacaacagcc 
ctgcggctca 
ttccagcgcg 
ggccgagacc 
acgcggccgg 
gaggccgggg 



tgttcagagg 

ggcgagtcac 

agtatccacc 

gcttggcaca 

acgatgatac 

acactcggta 

aatcagatgc 

aaagacccag 

tttgattagt 

tgctgtcagg 

tgaaccggcc 

agcagaggct 

ggagggctag 

ggagagagga 

acagcttgga 

gcaggacttt 

tgggccaaaa 

ttggtctact 

cccccaaccc 

ttgctgacac 

agcctacccc 

tgaggccagg 

agcagggggc 

caatgtggcc 

ccagatggga 

ctgggagtca 

tggggaggaa 

cccctgggtg 

gaacctgctg 

agggagctga 

ctcctctaaa 

atcatcagaa 

tgggccacac 

tctgcactaa 

ctaaggggag 

tggagctgat 

gagctcctct 

gtagccacca 

tttatcatcc 

catactgagg 

aaaaggcaag 

ggaccaaatc 

aaccatcctc 

cagatgagga 

caaacaaaaa 

tcatagaact 

ctctacatta 

ggcatcagtt 

tacttcacca 

cttccctggg. 

agaaactctg 

aaagtttagg 

aggcagaatc 

ggtgcacccc 

ccattctctt 

ccaacctgag 

ttcgcggcct 

accgcgcggc 

tggcctcgct 

acgctaaccc 

cgcgcgtgtc 

tgcgcgcgct 

gcagccgcgc 

cgcccccagc 



tctctggagt 
ttcacttctc 
tcatagggtt 
tggtaagagc 
ccatcctaga 
cagctctgtg 
agatgggatc 
gcccagagag 
gtcagagcca 
gaagcaggca 
agtggggagg 
ctcagcccag 
ggctgtgaaa 
aggaccggat 
tggggggagg 
ttgcagagaa 
cctggggctg 
cttctgacct 
ccggcccctt 
ttcataaaaa 
ctggatgggc 
gaggtgcaag 
ggagaatggt 
aagaggaggg 
gggggcgaga 
atgccctccc 
agcgtctccc 
aactggcctg 
accacagtga 
agtcctgctt 
aaagtgagct 
tcccctggtg 
acacccgccg 
tgtggcctga 
tagtaatgca 
gcccctcacc 
tttaaagaaa 
tttattgatg 
aaccctcctt 
aacctgagac 
ctggattttg 
cctgtgtgct 
tggaatcctc 
aacaggcaca 
caaaaacaaa 
ccacaaggaa 
tacagatgag 
aatgacagat 
aaccagaagt 
gaacctgtag 
ccccccgccc 
aaccactgac 
caaatccagg 
cttccctccc 
cttctgtgcc 
ccacctcttc 
gggcagcctg 
cttccgcggc 
gcccggcgag 
ctgggcgtgc 
cagctccgac 
ccgcgaggcc 
ccgcggcaac 
cgatccctcc 



tggaagccac 
tgaggctcca 
gtgacaatta 
tcaatcagtt 
ctgatgagct 
gaatgaatga 
ttaaagatca 
gtgcttggag 
agagctggga 
gggatgctgt 
caagggaaat 
agggagggga 
gtcaagagct 
ttgaaagcta 
aggcaatgga 
acaaaaggag 
tgggtcaaag 
cccgaggtcc 
atcgatcctc 
gaggaacccc 
ctggaagaga 
gagccagctg 
ggtgtcagag 
ctcttggcac 
acaggcagga 
ccaacctgag 
caggcagtga- 
agcagagtgg 
tgccctgcac 
ccctctctgg 
gggctgatgg 
agctggttat 
ccccccgctg 
ttaggtgact 
atgtggcttc 
ccaataccca 
agggacagga 
gttgactatg 
tagcctgctg 
tccatgaggt 
aactagggct 
gggcacgtgt 
agaacaaacc 
gagagatgac 
aattaaaaaa 
aggtgttcta 
gaaactgagt 
ccaagatttg 
tctgaaactc 
acatggggat 
caccccgctc 
ctcaccaata 
atgctatgaa 
ccggccaagg 
ccactccacc 
ctccacggga 
gaccggctgc 
ctcagccgcc 
gcgctcgccg 
gactgccgcg 
gtgacctgcg 
gacttccagg 
agctcctcca 
accctctacc 



gttccactgc 
tctcgtaatc 
agttactata 
acctgcttga 
ctgtaagcgg 
ggcacatccc 
cctatcctaa 
ctgcgcaagg 
gtttggaggg 
gttaagatfcc 
gtggtttttg 
gatagggagg 
tattaatgca 
cattcaagga 
ccccaaggca 
aggagaggag 
gcacctgaat 
cccaaaatgt 
tgaccataca 
atttaggtgt 
acaagagcac 
gaggcctgag 
gcagccgaga 
gctcagttcc 
gcacaggaag 
gcctccgacc 
gggagggaga 
atgctcctgt 
aagaggggag 
caagccctta 
gtgccaaggc 
aatgcagagt 
ttaattctga 
ccctagcacc 
cttcctctcc 
gcctagtagc 
cccaattgtt 
cacctgccag 
agggggttat 
taaaacttgc 
ctaagtgctg 
ccagcacttc 
caggaagtag 
tggcttggcc 
aaaaaaagaa 
agcaccttca 
cacagatatc 
aaatcagaaa 
aaactgtggt 
tcccaggctc 
agagatccgc 
ccactttttc 
tcaaaaggtc 
ccccagcggg 
ccacccagga 
accgcctgcg 
tgctgcacgg 
tcaccatcct 
acctgccctc 
cgcggccgct 
ccaccccccc 
cgtgtccgcc 
accacctgta 
gagatctgcc 



atattagctg 
tctgaaatgg 
taggatctgt 
caatgctgac 
gggtgcctgg 
agaactcacc 
gtcccttgtt 
tcacacagcc 
aggcaaggtt 
caaatggatg 
aaatggaaga 
ggagataggg 
tagagaacgg 
agtggcaacg 
gaggctcaga 
gttagaatca 
tccctaggat 
ggattacccc 
tctctgggtg 
tttgagtggc 
caggccatgg 
ccaggatttg 
aggttgaggg 
tgtagcgaag 
gtggaggctg 
aggctcctgg 
gccacagtca 
tctgagaccc 
gacctcaagg 
tctctttgag 
attagctccc 
ccaggaatcc 
accatagttc 
aggcaggtgg 
tcccctgccg 
agtactttgg 
actgagcccc 
atactgtacc 
acataataag 
ctaaaataac 
agcctgtggg 
cctcatatga 
gtatactcat 
aagttaagaa 
taatggctaa 
tacatgctgc 
ctgagtgact 
ggctggctcc 
cctgccaatg 
caccccaaac 
aggggatcct 
cacagcaaat 
aaccctttct 
gtctgcaccc 
tgacttgttc 
gctgctcaca 
gaaccggctg 
ctacctgttc 
gctcgagttc 
ctgggcctgg 
ggagcgccag 
cgcggcaccc 

cggggtggcc 

tgccgaagac 



135360 
135420 
135480 
135540 
135600 
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138840 
138900 
138960 
139020 
139080 
139140 
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tcgcgggggc 
ggtgaggacc 
tcccgaggcc 
gtgccccacc 
tcccgtccac 
cctccctttc 
ccccctgggc 
gccctcccca 
ggcagccgct 
cagagcgtct 
ctttggggga 
tcaacctcct 
gcacctggag 
tgtttgggaa 
acttgggacc 
ttggagaagg 
cgctcccccg 
gccaagagtg 
caaccctgag 
acagccctct 
gtcacataca 
ggcactgctc 
accaaggctg 
agcgtacaca 
aggtagaagc 
agggaccgag 
gcttgaggta 
gaagagctca 
gcacttggag 
gtggtcgctt 
ggccaagaat 
ttttaagaat 
gggaggctga 
ggcagaaacc 
ctatagtccc 
gactctgtct 
gcctagatgc 
aaatgatctt 
tagactccgg 
ccagggagct 
taatgggttg 
gcctgggcag 
ttatcattgt 
aagaggagtg 
acccctccca 
cccgagctct 
ccacccacgg 
gcacacccac 
ccccaaactc 
ggatgttcaa 
tagaaaggaa 
cctcccacat 
ttgtaattaa 
cccagccaca 
cattacctca 
atggaatctc 
aatttgcata 
taaaggtaaa 
ttctagcctt 
catactggcc 
aaaatttctc 
cagagccctt 
gcagcagaca 
accttgggaa 



gccagggcgg 

agcgagggga 

ctgcgctctc 

acctctgact 

ccggggctgc 

ccctcccagc 

tgtcctgact 

gttctggcca 

gacccgcact 

tccaccagca 

tgcctcagtc 

gacactggag 

agaactttcc 

aaaaaattta 

ccgaaagcgg 

ggagccgtcc 

ccatgaggag 

ggtgctggtg 

tggtctggcg 

tccagaccct 

cactcctggt 

cctcagggaa 

ggtttccgcg 

cctgcacaca 

catcagggat 

gaactcgggt 

caatcagaac 

ggctaagaag 

agaagtttgg 

gcacctgtaa 

ttgagactag 

ctagtttgga 

ggtggacaga 

ccgtctctac 

aggtactcag 

aaaaaaaaaa 

tagggcttga 

gagaagaaga 

ttaagggtgt 

cggagtctgg 

gattggaggg 

ggggagggga 

gaaatattca 

tgtgagcctc 

ccaccagccc 

cacacacact 

gtcccacaaa 

tcaatcagga 

tccaaacaac 

ggccatccag 

gggatttgtc 

tctgagtgta 

ctcctccctt 

gtctatccct 

tgcctggaat 

tccctccctc 

tgttatgatg 

acaccttagc 

ttctctcaca 

accagttatc 

agtctctcct 

tcccaaggcc 

gaggctcagg 

gcacagccag 



ggacgcgcct 

gcagatgtgc 

ggccgggctc 

gcggtgctga 

ggctccggcc 

tcctctcctc 

tgtggcagcc 

ttaactcttc 

cctaagggcc 

gagcctttgg 

agggccaggc 

gaatactttt 

tctccactgt 

ttaaaaaatt 

gatgaggtct 

cccgccatca 

gagctgaagc 

gcacctcttc 

cctgaggaac 

gttccttcag 

cacagacagc 

acacagcccc 

tacacttgcc 

gcaaccctgt 

gaaccaggtc 

gatcagaggg 

cttcatcttt 

gacttgggtt 

tagtttgaat 

tcccagctaa 

cctggacaac 

gtggggtgtg 

tcacttgagg 

taaaaattca 

gaggctgagg 

aaaatctagc 

ggaagcaggc 

gaagaaagga 

ggggtgggct 

ggtcttgcag 

acagccagac 

gctgctggta 

tgggccatcg 

aggaagaagc 

agacagaccc 

cgcaccaagc 

caggcaggtg 

cccagcagag 

ccagattcac 

cccaaccccc 

tgagattcct 

cccccattct 

tctctaattt 

tccttcccat 

ataaaagaaa 

ccacccagcc 

tcactcagga 

agagcatcaa 

atctcatccc 

atgtatgtga 

gaagtaggaa 

aagactgcac 

ctctggctcc 

gtttcagtcg 



actgaggacg 

cccggcgctg 

cccagccctc 

gatcgaagag 

ccagtcgccc 

cccggggagc 

ccaagagggc 

cccatcccaa 

cacagcggac 

aagctccccc 

tgaccctgac 

ctcctaagtc 

ggcccctgcg 

ctattatttt 

cagaatgtaa 

aagagcttcc 

cctgcattct 

tcttcatttg 

taagcctggg 

gaaacaagag 

cccaacatgg 

atgccagagc 

tccttggcta 

gggtatgtgg 

agagaagcag 

acaggaatcc 

ctggtgtgtg- 

gggatttagg 

ttggagctaa' 

ttgggaggct 

atatcgagac 

atgtctcaac 

tcaggagttc 

aaaaaattag 

cacaagaatc 

ttgggaggtg 

tgaggttctg 

gaagaaatca 

gggggagagt 

gggcttgtgt 

aagagctctg 

aattagaggc 

gtccagatgc 

ctgaagcaaa 

acggacgccc 

agagccatgt 

tcccctaaat 

atcacctcca 

caccttgacc 

accaatgctc 

ctatcccctg 

gcattttcgt 

cctgcatcta 

tctccccacc 

actgcgataa 

ataccgtggt 

gcctgtaatg 

agatccctca 

aaaccttcac 

aacccaccaa 

acctcttccc 

ctctctgacc 

ctccaagcag 

tctagaacag 



actactgggg 

cctgccaggc 

tgctttgcct 

gccagtgtcc 

caccttccct 

aggccgcctc 

gtgtgtggtg 

ggctggggtg 

accagagggg 

agggagcccc 

ccctgcttac 

taccctggac 

tggtgaagat 

atctactgta 

ggattgcagg 

tggtggctgg 

aggtgaggcg 

tccaggggaa 

gaagacctgc 

cagttctcct 

ctttgggtaa 

aaacacctta 

agtgcccttg 

tctctctctc 

gtttccaaac 

caaattggga 

gaagagaggc 

ggtgagtctc 

gaatctagct 

gacgtgggag 

tgagtctctt 

gtctgtaatc 

aagaccagcc 

ccaggcgtga 

actccagcct 

ggaatagaaa 

tgattctggc 

gcatctaagc 

gagagcagct 

caggctggct 

gtggaggagg 

aggctgtcca 

tatttcagaa 

gccactctcc 

atcacgtgca 

agcacgtgca 

tctgacatgc 

gcgatctcac 

cacacaccct 

tgatggggaa 

aaaaaagcaa 

ctgccagaca 

ttcagttacc 

acctccctgc 

cctcctcgct 

gaccagattc 

gcttcctaat 

gagtctggta 

tccagctaga 

ccgactttga 

cctcctcaga 

atatacaggg 

acggctgccc 

agaatgagca 



gggctacggg 

gcccccggac 

cctgctcctg 

gatccccgct 

ggccttgctg 

tccttgcctg 

gctcagccct 

gggcccccca 

cttttgtctg 

acccaggacc 

cctagtcccc 

actttttagg 

caaaagaagt 

agatttgttg 

gccaggaggg 

aggtggtgtg 

cagtgtggca 

gagctgcagc 

tgtctggtta 

gcaaggagga 

atgtgaacaa 

gcaaacagag 

tgcagtgcac 

agctcctgtg 

aggctagaag 

tgcattactg 

tggggactgg 

atcagactga 

tgggcagggt 

gatcacttga 

aaaaatgttt 

ccagcactct 

tggccaacat 

cggcgggtgc 

gggtgacaga 

gatagagggg 

tagggaggtc 

ctgaggcagg 

ggtcagaaac 

gtgaggaggt 

ggctgctggg 

ggtcatagaa 

cagtgaaagc 

accaaccccc 

cacccacact 

agcacaccaa 

acactgacat 

atgcgcagac 

gagataggag 

actgaggcca 

aattcattca 

cccagcctag 

cagtccccca 

tccagctact 

ggtttcctac 

atctgatcaa 

gcctataggg 

ccaactgctt 

acgtttgtat 

gtgcccccct 

tctcagactc 

gttcttcaaa 

ccgactggcc 

tctaaccgcc 



139200 

139260 

139320 

139380 

139440 

139500 

139560 

139620 

139680 

139740 

139800 

139860 

139920 

139980 

140040 

140100 

140160 

140220 

140280 

140340 

140400 

140460 

140520 

140580 

140640 

140700 

140760 

140820 

140880 

140940 

141000 

141060 

141120 

141180 

141240 

141300 

141360 

141420 

141480 

141540 

141600 

141660 

141720 

141780 

141840 

141900 

141960 

142020 

142080 

142140 

142200 

142260 

142320 

142380 

142440 

142500 

142560 

142620 

142680 

142740 

142800 

142860 

142920 

142980 
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tggggagagg actaggacac cagatgataa ggtttataag cccttaagcc tctaaggttc 143040 

ttacacccag agtagggggg ggacggttct cagccctgtt tccctagctg cgggctccca 143100 

attttcgatc cctaatccga gaggaactcc tctccaatga aatacagact tgggactctc 143160 

aggacactgt ggaagggaaa tttcccaaca gactctgaga gtccaggagg ccagggatag 143220 

accaggtggc aggcccaagg tccagctggg gtcaggtttc tatatgaatt tttaatgctt 143280 

ccagatagac ttgtcagatg ttctgaaaac tgagcatctc ctttcacctc tgtacatgat 143340 

gcccttctcc aaccccattg cccctgcagg agggcaggcc tgggacagat attcagtggc 143400 

ctctggagaa acggttttgg gacagtagaa gggtaaatga cctagttatg ttcccactag 143460 

taagctgtgt gaccttgggc aagttactta acctctctga acattagagt tctgtgggtt 143520 

tgtttttgtt ttgtaagctg gggacaatag tgccagccta aatcaatttg ttgtggggac 143580 

tcagtgcaat agcccatggc aaagtgacct acatgcttgc tgttattatt ctctttcctc 143640 

aagttctgcc tccctcttcc agcttttctt ccaaccccaa agatgtctct ggctattgct 143700 

tcgaaggtag gaactttggt tggttctccc ctttctcttc aggcccaaac tccccacctc 143760 

aagatccttt ggcctttgta gaaacttcag gtgaggaggt ggcagagaaa taagaaagtg 143820 

tgcaaggctg gtggagtgag agaggaggat agatggcgaa gccctagcag aggggaggga 143880 

agtgggcagt ggagagagg 143899 



<210> 16 
<211> 215980 
<212> DNA 
<213> Mus sp. 

<220> 

<221> modified_base 

<222> (1001) . . U100) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (2123) . . (2222) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (3728) . . (3827) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (5168) . . (5267) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (7481) . . (7580) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (8849) . . (8948) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (10375) . . (10474) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (12270) . . (12369) 

<223> a, t, c, g, other or unknown 

<220> 
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<221> modified_base 

<222> (13438) . . (13537) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (15902) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (15939) . . (16038) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (18223) . . (18322) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (20974) . . (21073) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modifiedjoase 

<222> (24403) . . (24502) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (27574) . . (27673) 

<223> a, t, c, g f other or unknown 

<220> 

<221> modified_base 
<222> (30892) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (30901) . . (31000) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (34443) . . (34542) 

<223> a, t, c, g, other or unknown 

<220> 

<221> raodified_base 

<222> (38205) , . (38304) 

<223> a, t, c, g, other or unknown 

<220> 

<221> raodified_base 
<222> (42373) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (42386) 

<223> a, t, c, g, other or unknown 
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<220> 

<221> modified_base 
<222> (42393) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (42461) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (44809) (44908) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (51380) . . (51479) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (56740) 

<223> a, t r c, g, other or unknown 
<220> 

<221> modified_base 

<222> (56765) . . (56864) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (62818) . . (62917) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (68518) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (68534) . . (68633) 

<223> a f t, c r g, other or unknown 

<220> 

<221> modified_base 

<222> (74552) . . (74651) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modifiedjDase 

<222> (81446) . . (81545) 

<223> a, t f c, g, other or unknown 

<220> 

<221> modified_base 

<222> (88519) (88618) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (93791) 

<223> a, t, c, g, other or unknown. 
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<220> 

<221> modified_base 
<222> (93794) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modifiedjaase 
<222> (96565) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (96570) . . (96573) 

<223> a, t f c, g, other or unknown 

<220> 

<221> modified_base 
<222> (96579) 

<223> a f t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96590) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modifiedjaase 
<222> (96596) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96602) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (96616) 

<223> a f t, c, g, other or unknown 
<220> 

<221> raodified_base 
<222> (96629) 

<223> a, t f c, g r other or unknown 
<220> 

<221> modifiedjaase 
<222> (96633) 

<223> a, t, c, g, other or unknown 
<220> 

<221> raodified_base 
<222> (96668) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified__base 

<222> (96715) . • (96814) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (104447) (104546) 

<223> a, t f c, g, other or unknown 
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<220> 

<221> modifiedjaase 
<222> (114521)"** 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (114527) (114626) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (127063) . . (127162) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 

<222> (139133) (139232) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified__base 
<222> (151051) 

<223> a, t, c, q, other or unknown 
<220> 

<221> modified_base 

<222> (153242) . . (153341) 

<223> a, t, c, q, other or unknown 

<220> 

<221> modified_base 
<222> (164706) 

<223> a, t f c, g, other or unknown 
<220> 

<221> modified_base 
<222> (164708) 

<223> a, t, c, q, other or unknown 
<220> 

<221> modified_base 

<222> (164710) (164809) 

<223> a, t, c, q, other or unknown 

<220> 

<221> modified_base 

<222> (182242) . . (182341) 

<223> a, t, c, q, other or unknown 

<220> 

<221> modified_base 
<222> (192158) 

<223> a, t, c, q, other or unknown 
<220> 

<221> modified_base 
<222> (192192) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 

<222> (198842) . . (198941) 

<223> a, t, c, q, other or unknown 
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<220> 

<221> raodified_base 

<222> (199437) . . (199438) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modified_base 
<222> (208276) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modified_base 
<222> (215974) 

<223> a, t, c, g, other or unknown 
<220> 

<221> modifiedjaase 

<222> (215976) .. (215977) 

<223> a, t, c, g, other or unknown 

<220> 

<221> modifiedjaase 
<222> (215979) 

<223> a, t, c, g, other or unknown 
<400> 16 

ttgggggtat aaacccagaa gtgggattac tgcaccatac aataatcctc taacttcaag 60 

caatttttcc acaatggttg tatcatttta cattcccact ggctacgaga agggttccca 120 

cttctacaca tcttcaccac catttctgtt tttgtttttg agtaacagct gcctaatgac 180 

tgtgaagtgg tatcttatct cagtgttgat ttgcatttct ctgatcatta atgtgggaag 240 

gcatcgtttc atatgtttat tggctgtttg tgtatcatct tctttggcga tgttgattca 300 

agttatttgc ttgttttttt aattggagtt ttaaaaaatt gttgttgagt tgtgggagtt 360 

cttcattagc tctgcatatt aataccctga tgaaaatgat taacaagtat ttgcttccat 420 

tttgggggct tccattctgg gctgttttta ttcttttgat actcttttga ttctcaacag 480 

tttaatctga ctaaaattca gtttatttct tcttttaatg gccatgctat tgacacatcc 540 

cgtaatcact gccaaatcca gtcatgaaga gtttctttca agagatttat agttttagct 600 

ctttaagttt gtcatgtctg tttcacttaa ttttgtatag tgtacaaaag tctaacttca 660 

ttcttttcta tatggcttgc tactagtata cgaagagcta aatttctctt tccttgagtc 720 

tcaacctctg atgtgtagca atttcttcag aggaaaacat ggtgggaagt tccttaaaca 780 

taggatgctc catggaggtg aaatagttca tcctacaggg aagcttgtta aacacaggaa 840 

gtacatactc agcagctcta gtaagtgagt gaaactgact ggaggcacta ggtccctcct 900 

tccctacgca tatagaagct gtaaggattg ggaagagata ctgtcaggtc agctcagctg 960 

ctgcccggaa gaagctcaga cccactggcc tggctccaag nnnnnnnnnn nnnnnnnnnn 1020 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 

nnnnnnnnnn nnnnnnnnnn atcactcttt actcaggcca cctacacgct gtttatagcc 1140 

tgcctttgtc tctttggcta tacttcctgt ttatgtctat gcctcccctc tttctttttc 1200 

tttctcttct cttctcatct catctcatct ttcttcaggg gggagcctgg tctagaactc 1260 

acaaagattt gactgtctct gtctccttgc actaattaaa aaatctttta caagcatctt 1320 

ttagcaattc ttacagggaa attttggaat gttaaactct gattgttagc gggctgaaga 1380 

taacaatagc tctgatgata aattgcttgc caggcaagtg tgaaaatctg agtttgatcc 1440 

aaaaagccgg gtacagaggc caaagagtcc ataatcctag taggggcagg aatcagggat 1500 

gggtgggtcc ctggggtttc ctggtttgtc agcgtagccc aattgggaat agccaggttt 1560 

cagtgaacga tgctttctgc aagctgagag aggtccttgt tcaatctctg tgacccaact 1620 

ggagggagaa gagagccagc tctccagaag tggtcctctc aactttgtgc atgcatgtcc 1680 

atgttcacac agggaatgga taatgcttaa aaggaagacc ggcagggggt tggtaatgca 1740 

cctcctttgg tgacatgctt tcctcttgtt catgctgctc caggtgtggt cggcagcacc 1800 

aaaaaccagg tgtatgtttg taatcccagt attctctggt cgtcagtagg aaatgaaaag 1860 

cgaggtcatc ttcgtataga gttagcaaac tctaagccag cctcggctac atgagacttt 1920 

gtctcaaaac aaaggaaaaa tcaaggagga cggctcccga gcactgtcac ctgaagctga 1980 

cctctggcct ccacatgcat gtgcgcaaac acatgtcctg cacaaacaca cagacacccg 2040 

catctgctcc ccgacaaaag aacctgaaac cagtatactt tgagaatttc ccattcatag 2100 

ttaccattgt gtgttccttg tgnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2160 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 2220 

nnggtggtgc ctttctctta cccagtctag aagggctgga ggcagggtgg atggggcact 2280 
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ttgaactccc acctaggcaa aaaacccagt gatctctggg ccagtgtgtt gtttgcaagg 2340 

gaataaggta gagagccgcg gaggaagaca ttgggggttc tatgagtatg tgaaggggtg 2400 

cacacaccac acacacacat tttttttgtt ttaaatttac aaacattaaa ataggctgta 2460 

atgtggctca gtgggtagaa aaacctgctg tctaagcctg gtacgagttc aatccctgac 2520 

aagctggaag gagacaacca accacaactg ctagcagcca gaagcactgc ttgctaacac 2580 

tcaagagagc ctggagtgga agacactgga tccccagcag gcaagcctgc aagaagatgt 2640 

gccttgccta gacaacggca gaacaaacat caaggctggc agagctgtcc aggactgttc 2700 

atattaatca tgtatagata agagggaatg gcacagacag aacaattcaa cacacggggt 2760 

atgaaaggaa aggaacaagg cacacaaagg acaaagaacc tagcatacaa gaaagcctaa 2820 

gcagagagtg gcacttccca gaagggagtc ataaaataga ctgaattcat taaaacaaga 2880 

gccaaagata aacggctcaa aaaactcacg gaaaacaggt caaaataacg tcacccatct 2940 

gacagttgat actgtcaact taaccgtatc tagaactcca gcaggcacat ctccaggcat 3000 

gcccctgaag gggtctttgg actaggttaa ctgacgtggg agtgacacca tctatggacc 3060 

gaagcctcag acagaataaa aaggagccag tgagctgagc gtcagtgctc attgcttctg 3120 

gcttcctgtc tgtggctgca gcgagacacg gtgcttcctg ctttagctgc catgacagac 3180 

cacaccctca aaccgtgaac caaaataacc tcctctctac attgctttta ccaggcattt 3240 

ggtcacacca atgagaaagg ttaactaata cagcactcaa tacttaaaaa cataaacacc 3300 

aaccttgttt gcatgtgtga gactttgaag ctcacgggcc agttatgccc aatgccaggt 3360 

ctgctggcta agggtgagag tgcacaccta taatcccagc tgctgtggaa tcagcaaaag 3420 

cgctacagat ggaaggcagc cagggcagct gagactgact caaactgata gaggtgggag 3480 

gcatagagaa aaccagatta atagagtgtt ccccactatg caagaagccc tgggtttcag 3540 

gacgagagaa ctaagaatac agaagtctac tgtgtagaag cactgctagg tcacacagaa 3600 

acatcactca agtgtctctg gatgctacac ggagggcgtg tgaagtattg cttcctgatg 3660 

atctgtatct actacagcac tgctgtttta gtatgcgctc ctccactaca gctcctcacc 3720 

acaccaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 3780 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaat taatcaaaga 3840 

aaacacacac caccagttag agaaagttaa tcaggccgaa tggcggcttt cccctgtatc 3900 

caggctaccg tcaggacggc tcactgccac tggcaactct gcctgaacaa agcccgcagc 3960 

caacgtgggc ttcaggggct ctaaacactg caatcaaagg ttgtgtgtgg gggtgggggt 4020 

gctgctgcta ttcaaggatt cccaaagctt agatgtattc atcatactca caggaaagcg 4080 

tgttcaaccc atcactcatg agcagtcggt accggggtga cctattccct gtagaaatgg 414 0 

gacggatgtt ctggaaaagt tgacagaaaa gttgattcat taggcaggct ctttgcccaa 4200 

gccctgaggg taagcaaagc taactggcag gagactaggt ttgccattaa tctgagacaa 4260 

gatgaaccac ttgcccatcc tcctgacacc taaatactaa tgaaagaaca atggattgag 4320 

ctggcattat taaaaacgat agaaacagaa gtatcaatag tcatgtgttc tttctcccat 4380 

atgtcaaaac aatgtgtaag atggcatcga acacatgcag aaactgttta gggaacatgc 4440 

tgaaaatatg aagtaaaatt aaaattggaa agaaagacaa tttgcctaaa gcagctcaga 4500 

gctggagaag ggaccgaggc agagataaca gcaacgtgtg gacatacgga tctggggcag 4560 

agcagtcacg gactcagccg gaaagggtgg ggcagcctct gaaggaagtt aaggtaaata 4620 

gagccacaag gtgattggcc caggagtggt gccaccttca cctcctgcct caaagtctga 4680 

aggaatgatc ctggagtctc ccatctattg atatatgaaa ttcacagtat gttttagaac 4740 

ccactgaatg atgggtagat taactaaaag aaatttaagc ggggtggtgc aggtctttta 4800 

atcccagcac ttgggaggca gaggcaggtg gatctctgtg agttcgaggc cagcctggtt 4860 

ccaggacagc cagagataca tagagaaacc ctgactcgaa aaaacaaaat taaaagctca 4920 

tcaaaacaac aacaacaaca aaaaaaacaa aaaaacaaaa caaaacaaca aaacacccta 4980 

tagtacctgt tggtgagttt gagtgagtga gtgagtgtgt gttagagaga ggggcgggga 5040 

aagtgtgttc tggaaatggg agaaagagaa tgtgcatgtg tgtttctggg atgtagacaa 5100 

aactacatgt cttccatcaa atgcaatgtt taattatcta tgagttgaac catcttcatt 5160 

ctgctaannn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5220 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaac aaaaataaac 5280 

caaaccagta aacaaaatcc tgtaagataa agcctaagac aagacacttc ctggggctgg 5340 

ggagttgctt agaccataag gagttcataa tccaggcgtg agagcccgag ttcaggtccc 5400 

tgggcttcca agtcaggagg agaccaagga atcaacaagt ctcgactttg gtctctagtc 5460 

ccatgcacac acacgcgtgt aaatacgtag atgttcactc acacacagaa gactgcacct 5520 

ggctctctca catctcagcc aacatataaa gcctgcatta tcagaacatt ctaggttcta 5580 

gtttcagtca actcttacac agaatggcca tcatactccg tctacaactt ctcctgatct 5640 

acccacgtgt cattgcttca gtattaacaa aacccagaat aaccagctgc gtagatcctc 5700 

cctgatgccc cagtcattgt cttactgaga ctactaagtc acaaggtagc actctggatc 5760 

caaaaagcaa tatccaattg agagttacaa cctataagga ggagtttacc ttcattatag 5820 

ggcactggat tcccaatctt taatccaacg tcttcagcag atttcataac ttccaagtcc 5880 

atcaaaacaa ctactttcct acaaagacag acacaagtta gaattaagaa ctctgcagcc 5940 

tttcagatga gttactaaga agcttacttt agtagttgtc tggctaaaac tgtatccttt 6000 

accaaccttt tctcattctg gactaacttg agaagtatta attcctaagt aaatacttca 6060 

cttattcttt ccccacatct ccaatgtttt tgtctttaat ttattatagg gcaattcatt 6120 
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tcctatctag ttccctgatt aaaacagtag accttgctgc atgccattat cctcatggag 6180 

gcactgatac aatttagatt attaaataca aaaccctaaa acacaaaaag atgatttttt 6240 

tttaaaacaa gattttaaaa aaagcatgtg ctacgcttcc ttctgccact aagcctacac 6300 

atggtcctct gactgaattt ttcccctcat tctgcttcat ctaatatgtg cttttcaaac 6360 

ctggaattga accagggact tattcatgct aggcaaatgc tctaccatag agctataccc 6420 

ctccaactcc catctcaaat atcatttcca aagacatttt cttggtctct tatttagatc 6480 

aggtttcttt gtcctcctgc agctatgact tcattccttc agaacactcg tcttagcttt 6540 

aagttctgta ttaattagtg attgttttca ttctctctgc tagaatgcac tttcaataaa 6600 

ggcaggtagc cagccacagt gcttaattaa gcaacagccc aacgatgtca ttcactacat 6660 

actgggacaa gatgcctaac atcatctgca gataaagacg aactactggt gtcaggagac 6720 

agctaagggg tccagggctt gggcacgctg agtgtgagca ctggagtccg ggtgcccaga 6780 

aacgcacata aatgcaatat ggatgtggca atctacctct aattccttct ttaagacagt 6840 

ggctctccag agcaagctgg ctagcaagac aagccatatc agtgagctct gggcttgacc 6900 

aagaccctgc ctccaggtgt aactcccaag caaaaggatg atggctcaca aatctcaggc 6960 

tatcatgttc atgtacaaaa tgtcaaccgg catacacaca tgcacacaca tgaaaactgg 7020 

gagaaaataa gaagaattgc aaccaaaaaa tgtaatttga ggacacataa ttgcaggcgg 7080 

ggagtggggg gatgacagaa ggtgaactga gtggaccgag ggaaagctgt gctagcggca 7140 

atgagaagaa gggtggggca gtctgagcaa gggttcagca atcaccacgc tttactgtct 7200 

gcacagcctg gctgtagaat gctgggcttt atcacacaga attattcagt atgtgctatc 7260 

tttacagtaa agttattcta tcaggctatg ctacttcaat agaacaagcc tgaaaaagtg 7320 

gtctgctgct gagaacctga caaagatgac ctgttagaac tgtctgccaa gtgtggaatt 7380 

ccagcactgg ggaccaggag ctcgagggtc accccagatg cagggagtta gaggccagtc 7440 

ttggcaacat aacatcatgc ttcagaaatt aaaaacaaaa nnnnnnnnnn nnnnnnnnnn 7500 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7560 

nnnnnnnnnn nnnnnnnnnn catgagatag ttaataaact gaagaaagcc atacaaggag 7620 

taaagtagat agttgcaagc atgaagaaag acaaaccact tgagcttttc ttttgtcgta 7680 

aggaggaaac cagacaggtc cagagagatg gctcagagat- taagagcact gactgctctt 7740 

ccgaaggtcc tgagttcaaa tcccagtaac cacatggtgg ctcacaacca tctgtacagc 7800 

tacggtgtac tcatatacat taaataaata aataaataaa' taaataaata aatcttaaaa 7860 

gaaaaaaaaa aaaaacctaa ccaatcagcc aggcgatggt gacacatgtc tttaatccca 7920 

gcacttggga ggcagagaca ggtggatttc tgagttcgag gccagcctgt tcttcagagt 7980 

gagttccagg acagccaggg tgatacagag aaaccctgtc tcaaaaaaca aacaaacaaa 8040 

caaacaaaca aacaaaaaag gaggaagcca gacaggatgc actttatacg tgaatggaat 8100 

tgacaaaaga caagttctat aagtgttagg gaaaggggga ggacaacggg ggttcatgtc 8160 

tgtggtggaa cacgtattag aaggctctgg gtatcctgtt tccgacaaac aggcactccc 8220 

aatcacacag gccactggat gtctcaggca gagaaagatg tgatagattg actttttaac 8280 

aatcacagac tgtgtggaaa atatttgtaa ggttgtcatt gtcacccagg atagagctga 8340 

tggttattca aacgaggatg ggacaacaga aatgggagag agggatgtga gaaccatttt 8400 

gaaccagggt gatttactgc gcacgtgtat agggtctaca gggagtggga tatgtagagg 8460 

aggcctatgt tcctaacttt ggtaatgagc ttattacagt tactatgcac agcctggaag 8520 

atactggaaa aggtgcaggc taggctagaa aggtactaac tgagggtttg acagcccctt 8580 

ggatgtcagg atgcagcaag cctacctctg tatgtagtca atcccttctc aggctatggg 8640 

tcctgcagat catccgtctc tgtatccatt attcccagtc catcctctga gtggctccct 8700 

cttatccagt ttaacaaaat gctgactgca agctcccaag cccagggctc tggctccttt 8760 

actccttgtt attgtacttt accctgtttg cttgggatag agtgtgccct ttataaacat 8820 

ttgtgaaagg gggaatgaag aagaataann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8880 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 8940 

nnnnnnnnag agagctcaat ggttaggagc actggatgct cttccaaagg tctccagttc 9000 

aattcccagc atcaccatgg cagctcacaa ctgtctgcaa ttccagttcc aggggattca 9060 

acactcagaa acataagtgt aggcaatcta cgtaacataa aaataaataa atgagctgga 9120 

aaagaaaaca tgtttcaaaa tatacaagta atggggctgg aggagatgtc tcaatgggta 9180 

agatcattgg ctgctctttt ggaggttctg ggttcaattc ccaccaccca catgacagct 9240 

cacaactgtc tgtaactttg gtcctgtggg agctgatgcc ctcttctggt gtgcagacat 9300 

acatgtagac aaaacacctg catacataaa ataagttttt aaaaaagtta cacatacacc 9360 

cgtgtgtaat ataacacaca ctggcttaac ttcctcagca ctgactgttc accatacgga 9420 

ttcccatgag gttttggttg cattctatca ccgaaaaaaa aaaaaaaaaa ttagaagaaa 9480 

gtatatacat ataaacctct ccctaaaata aagttttctt ttctaaaagt acatccttat 9540 

ttttttattt tttttttttt ttaagaaatg ggaacaacag ttctgctcac actgtatttc 9600 

tagcatgtaa catcttgcaa gtacttaacc gtattctata tcagctcaac acacttacta 9660 

ccgaagactc aagatcacaa aaaaaaaaaa aggacccaga ctggataatt aaacgtttct 9720 

tttgttgtag taagcgacct cttccttaga agatactaca gtaatgctga agaaatgaca 9780 

catctactgt aatctgttct ctgggattcc aacttgtttc ctctgctact cctcccttgg 9840 

cggcaatgtt cgtctgcatc cggctgagct cctcgctgcc ttgttaaacc tccttcctga 9900 

acttccgacc tgtagttccc gctctacagt gcaagcgagt ggataaggaa gcgcatacct 9960 
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gccgtctttc agggtgttga cgatgaactt 
ctgccctccc tcgaccaggg tgttcagctg 
gagagagggg ctcagtctcc aagcctttcc 
attaaataca gctgtttaaa actcctccgc 
cccatgtaaa agtctgaggc tcttcctaaa 
tttttactgt aattctagtt tctggggaaa 
gagacaacag gcaaatattc cagaaaggcc 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

ttcggtccct cgaatttggg tttcttctgg 
gcttggagca gcagaggggt caggaactcc 
gtcctggaga acagaaactt tggttgcagc 
ctgttaaata taaatacctc cattatcttc 
ttttaaggat caggcacaga gactccatgc 
cccaacttgc ctttctgcta ttttttaaat 
gtcagggggc tacgtggagg aattgtttct 
tcaggtgacc tggcttggca gcaagcacct 
ttttaatggc cccttcccct gctctgaggc 
caggctcgca ggtccaccag tgagcaccag 
gtggttataa gcatgtgcca ctgtgtcaaa 
ccagcccttg agtttactgt ttgtctgtgt 
gaggtatttt gtcaagtttg acaaaattag 
ttctgttata gcttagtctg gtcttgaact 
aggattattg agatggacag gctgtgtgac 
agtcatttga aaaacgttgg ctcatgacac 
atttaatcca aagtaattcc agcaccagag 
agagatgcat agcaggcggg gccccactcg 
gaaaagtctt aagtattggg gactaaaagc 
ggaacttagc agacttgagg ccagcttggg 
acaaattgaa aggaaaaaaa aaaaaagctg 
ctgtttccac aagaaaacag ggttactttt 
ggcatttttg tgacttgtca ttggagaaat 
tctgaagagg ctcttttaaa tttagtttaa 
tgcacacaga aggcagcttt ctagagtctt 
aaactcaggc cctggctagg caagctctta 
ttaagaattt ttataagact ctgtttttct 
atccataaag cccatcctgc aggtcattgt 
agatggttca cttatagtta attaaacatt 
acttcccaga aacacaaatg cacattcttn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnng aattcagagc tgatatgtag 
cttctattcc ttcattacat tactgttaat 
actcttggac catccaaggc agaaggaagg 
gaatgtactt cagtttgtac aggaggccct 
cctgtaggat ctcagcactg gtggggggtg 
aggcagaata gtcataaatt caaagtctgg 
ctttagctgt tcttctagag aacctggtgt 
ccgaaactct gattctaggg ggatctgaaa 
acacatggtg cacatacata catgcaaccc 
ttcaaaaaga cattcaaatt cttcctcggc 
aacaaaacaa aacaaaacaa agaatcatta 
acggacatgc . ctggtaccca gggagattaa 
taggtatgat ctaccacttg ggtgctggga 
ttttaacccc taagctgtct ctcccaacgc 
cactggcaag catagcatgg taatctgact 
gattattaga agtaacgtac tagatttaat 
aagaataaaa ataaaaaccc tgccagccat 
gcaggtgatc tctgtgtttt gcaaggccag 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
agaaattcaa agaaacaaat gccaaataaa 
taaaaaactc aaggctttaa atggtaagat 
caaaatatat caatttaatg caattatgta 
cttgatagca ctcttataat tcacatagaa 



gtggacctgg cagacacagt tgctggccag 10020 

cgtggccagc atgaacgctg caaaagcaga 10080 

ttaacccgaa agctcatcac aaggagaacc 10140 

cctgcagaga ggaaagcagc atcaatccgc 10200 

tggtatctgt ttctcacagt ctccaaatca 10260 

gacctttctc ggtctttagc cccgtgacta 10320 

cccattttct ttttaaagct tctannnnnn 10380 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10440 

nnnngcacat cttgtgaagt gtccacatct 10500 

gacgtggtag catgtgactg tcactccagt 10560 

aggctggcat tagctgcaga gctggagcag 10620 

attaatgaac tagaagaatt tttttgtctt 10680 

tcataaacag tgttgccttt ttatttaagt 10740 

cagactacca ctcaaccact gagctacacc 10800 

tgtatcagtg gccaccaaac atggggagag 10860 

ctcctaccaa gtgggcccca ggtttcaaat 10920 

ttacccctaa gccatctcat tggcttcatc 10980 

aggctctccc tatatagccc tggctggcct 11040 

gtttctgctt gtccttacct ccccagcact 11100 

ctcagtcact aagctttgcc aagccatagc 11160 

ggtgatttgt caaaccactt ttgttccact 11220 

ttgagtatgt aggtcttttt ttctggaatc 11280 

catgatcttg cctcaacctc acgattattg 11340 

catgctcggc tgtgtgtttt agcatgcatt 11400 

tttacaggtc ttccatgttt gatatgtttt 11460 

gctgagacag gaggatctca aggtcaacct 11520 

gttaggttaa tatcatcact gacttcagga 11580 

aggaggatct gaagttcaag gtcatcttta 11640 

cgctgtggga ccctgttttt aaaccagaaa 11700 

gaggaagtga atgtgagtgt tcacatagtc 11760 

ggcaacaaat aggtgctttc tttgaaggct 11820 

gatttaatta agacttttct actgagtgcc 11880 

ttttatctca ttgttagtgt ggtgtgcttg 11940 

ttcactctct cctccacagc tcctggagtc 12000 

ggacagtgtt agctgtagct tattaagttt 12060 

ttctcaggtc atgatacagc aggaaaatac 12120 

aagtaccggc atgtgtgttt agcataatga 12180 

ggattggatg gaagacatgt agttttggtt 12240 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12300 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 12360 

tactaactcc tactcaatga atcctttgtt 12420 

agtggtaact atgtaccaaa gagtcaaata 12480 

ctggcaaaaa tgtatgatga tctgggatgg 12540 

tggttcattc catttctggc aatgcataga 12600 

gggggtgagg gtgaaggggc gggaggttaa 12660 

gtcctggaaa gaggactaaa cgattaagag 12720 

gatccccagc acatggtgcc tcacgactgt 12780 

accctcttct gccctctgta gatacagaac 12840 

aaacaaccca tatacataaa atattttttt 12900 

tatatagtgt ttaccaaacc tcaaaaacaa 12960 

atgttttgcc ttcatgtatg tctgcccacc 13020 

aagaagacat tagctcccct ggaatggaga 13080 

acctgggtcc cctgcaaaag cagtaaatct 13140 

ctaaagattc ttgtaacaca gcatgatgag 13200 

tcagggcgcc agattttgag cttaatgctt 13260 

gcctggagct tcaagcaaca aaattaactg 13320 

gatggtaatc ccagaacttg agaggcagag 13380 

ccacaatcta catagcacgt tgcagtannn 13440 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13500 

nnnnnnncga ataaatctac acatgtaaaa 13560 

tacacatatt gtaataaaga gataattgtc 13620 

atcatattct tggatgaaaa gatctaatgt 13680 

tattcaggag atctctggtt ggcttttgaa 13740 

gaaaaaaaac catgaaaact gccaaacatt 13800 
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attagaatac tccacagatg gtattttggc agcacataca tcgaagggct gtgaaagatg 13860 

tgtagatcat ccacgccttg ctagggagag ggcgggtgtg tgtggggggt atagctgttt 13920 

gggaaaataa cctggtaatt cctcattagt taaatcatag tcagaacctg gactagcaac 13980 

ttctctctaa aatacattca ccctcagcat ctgcattgcc aggaaaccac tcctagcagg 14040 

atctgtacgt ggatcaaggt agtagcatct gcatttaatt gacattctcc taaatgcttt 14100 

aaattatctc tagattactt atagtagcca agatgatgca aattatgtta cactgtatta 14160 

tctggggcgt aacaagaaaa tgtctctact caggttcatt caggtgcagt acttcccctg 14220 

aatacttctg aatacacgga tcaagaagcc acagaaagag ggctaaccat atacaagcat 14280 

atagtacact aataaccatg tacaaccata tagtacacta atattcagtg cattactcaa 14340 

aatgcaaaca gatggaaaca atccaacagc ctgtaagctg aaaaacaaga taagcaaaat 14400 

gtgctgggcc tagaggccca ggtctataat tccaactaag gtcgaggcag gaggatctca 14460 

agttcaaggc cagcctagac aacttagcaa gaccttgtct caaaacaaaa agtaaagagg 14520 

ctgaggatat agctcagtat agagcatctg cttagcatgt gcactgacag ccgtatcaca 14580 

gaggaaaaaa aaaaataagc aaaatgtgat ctgtctgcac aacaggatat cacagccccc 14640 

taccacaggg gaacgacaca gtaacacaac aaaaacttag ccctgaaaat actatggtaa 14700 

ataaagaagt gtcactgagg atcaggaaat gcatgactcc atttacatta tatagaaatg 14760 

agaagatcag tgagcctcta ggactcaaga gatttgggat tggcagctaa agggtactgg 14820 

gtttctttat gggggtaaga aaacattcta aacttaactg tgagaatgac tactcaacaa 14880 

tgtcaagtgt tcaaaaatca tacttttttt tttttttggt ttttcaagac agggtttctc 14940 

tgtgcagtcc tggaactcac tctgtagacc aggctggcct cgaattcaga gattcacctg 15000 

cctctgcctc ccaagtgctg ggattacagg catgcgccac cattgtccgg ctcaaaatca 15060 

tacttttaaa aattgcccag tgactcatga atacaatcag aggcgggaga ggacagtggc 15120 

aaactcagga taccagtgtc ttttatgtct gctgcccaac tatcaatttc ccatagttac 15180 

cagagaactt tttggtttgt ttcatcttat ttgttgcttt tggtagaatc tcaatatagt 15240 

aagatacaag gctggcctca tactatatag ctgaggacga ctttgaactt ctaatcctcc 15300 

tgcttccatc tcccaagtgg tgggattaca ggggtgtacc gctatgccca gcaagcacaa 15360 

agccatttga accacacccc agccttttca gagaaacctg tacaagcctt agtgccttag 15420 

catattaagg caacaaaaga cataatgcgt ggctaccata gagtgtttgc ctaccatgtg 15480 

tgaggctcta ggctaaatgt ccagcactta taaaaaagag ttaaaaacac tcatgactca 15540 

aggatgacta tgcagtcttg tgtacaaagc cccgcattca atccccagca ccgtgcacat 15600 

caggcaggct ctgtagagga cccagcttaa ggtcatcctt aggtaagtta gaggccttag 15660 

atggctacat tagatgagac cctttctcat aaacagaata aataatttaa agctcctgat 15720 

caaacactat gccttcccat cacactcaga ataaagcact ctactggccc tttaaggact 15780 

gcccatctgg aagagaaacc taagttacat tccttgcttg tgtcatatgt gataacaaac 15840 

tcactggaaa tacgaaaata cagtcttaag cttggtcaga aagcttcccc agcaacatga 15900 

tntcagagga cataatgcag aaagtggaca aatgcaaann nnnnnnnnnn nnnnnnnnnn 15960 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 16020 

nnnnnnnnnn nnnnnnnnaa tcagaggaca tctttcagga gttagttctt tcctccctct 16080 

atagctttca gggatcaaac tcaagtgtgt actgagcgct tatgcccagt gcgccatcgc 16140 

accaggcctg cttctttgtt ttttatgggt ctgaatcaat tagcaccatt acaacaatgt 16200 

tgacaatcag caagtacctt tctctacctg gctagtaaga gaagtaagtg cctttggtgt 16260 

gtgaacgcag tttctcttgt gaagtgcatg gacttgatct ttgctcacaa cgttttttag 16320 

gtccttaagt tgcttgggtt ttatgggaaa ggctcttggg ttttttgaaa agattttact 16380 

acaacttgat ataatcatta tttttaatcc tttaaatagt atgacttatt ttaacagatt 16440 

aatattgaac tgttctttca ttcttacata ataaatcctg ccttaaaaaa taatcctctt 16500 

agcttccttt ctctattttc aaatttgttt tatatttttg catgattttg aacatttata 16560 

aaagtaggca gacaacacag tagaaccaag tccccatata gctgtgcaca tagcttcaga 16620 

ttattgcctg ataataggtc ctgttttgtc tctgttttct cacagggatg tgttattgtg 16680 

tgtgtgtaca catacataca tatatgtatg tatgtatgta tgtatgtatg taatgaactc 16740 

cctttaacaa aacaagtact gggctgggaa gacagcacag ttagttatgt gtttaaccgc 16800 

acaagcatga caaccagagt tgagatcccc accaaccgca taaaaagctg ggcatagtgg 16860 

cattgacctg tagccctggt gctggatgaa agctggggag gcaggtagat cggcagagct 16920 

tactggcaac aaatctgccc agtaggtaag ctctgggctc agacatccta tataggaaaa 16980 

agatgaaggg cgaggcgcag cggcacacac ctttcgtggt agtgcttgag aggcaggggc 17040 

aggccagtct ctgtgaccag cagcctggcc tacatgtcaa gttgcaggac agccagagcc 17100 

accacctact gagactgtct cagaaataag ttttttaaaa aattgagatg aaggagctgg 17160 

taagatggct tagaaggtaa aggcacttat cactaagcct gaagccccga gtttgaccct 17220 

ggaccccaca ctgtagaacc aactcctcca agttcttctc agacctccag cagagcacaa 17280 

gtgtatgcag acacacacac taagtaagtg aatgtaaaaa acatgacgta gtggcactgg 17340 

cctttaaacc cagcattggg aggaagaagc gggtggatct cttgagtttg agaccagctt 17400 

agcctacata aggaatttca aggcagccag ggctacctag aaagtagctg tttatgaatg 17460 

aatgaataga aggaaggaag aaagagagac agacttaaaa aatatatgct ggagagtaac 17520 

agaagaggac accggcttgc tggtgtcttg acctctggct tgtacacata cacatgtgta 17580 

gtgcatacac ccacatacaa ttgtactcag acacacacaa acatgtactc attcatatac 17640 



73 



WO 02/29059 



PCT/US01/31488 



tgcacacctc aacactcaga aaatgaaaaa acaggtacca tttacacctc cgtgttcggt 17700 

ttccaaccac tcatatgtat gggttgtaaa tgcttatatc tgtatgtgtc tgtatatttg 17760 

tgtatacatt caaagttgag tcaggatcca acgtaaactt ggatagtagt gggttgatgg 17820 

tctggaagcc tgctcgcagc tgtctttttc tcctcgtacc ttttcccctg tttgtttcta 17880 

cgacagcagg tcatttgtct ctaagtgtta gtttcccatc ctctctcttt tgctgatggt 17940 

agccttgtag tagtcacctg tgttctctgt aaaatggctt tgccgtgtta tttcaatatg 18000 

ctatcatcct catcttgcta tatttcattc aatatatgta tatattacaa gatagattaa 18060 

aattatttta attttatgct tatgaatgtt ttgcctaagt atattgcacc ttgtgtgtct 18120 

agtgtccaca gaactcagaa gaaagtgtca catattctgg aactggaatt gcaggtggtt 18180 

gtaagccacc atgtgggacc tggaaaccaa atccaggcgc cannnnnnnn nnnnnnnnnn 18240 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 18300 

nnnnnnnnnn nnnnnnnnnn nncctttggt caaagatcct cagtttcgac tttgattacc 18360 

cagacttcct gtttctctca tggaacagtt tccccctgag atttactagt ggaagaaagg 18420 

cactcaaaaa gcagggagcc ctcgtacaaa tgagacttcc tagctatata attaggccga 18480 

gatgcacaca tccacagtca ttacccttct tcagagcctt tgtcatgtca agtgtatttc 18540 

gcccatgtga actttagaac tggcttgttg tgtttcataa aaagtgtagt tgggctcttg 18600 

attgggattg tgttaaattt atagattcat ttagggagag ctgacaactt tacagtatta 18660 

aatgttttca tccgcggaaa aggttgtctt gccacttact tgggctttct tttatgccct 18720 

taagtaaagg tttatagttt tctttatatc agtcttgcac atttcctgtt agatttattt 18780 

ttgcttgtaa ggtttccttg gttggtattg tgtataaaca ttttctccca attacatacc 18840 

ataattgatg gttaggagta taaataaaag gtagaatttt aaaattgctg tatgaatgac 18900 

tgctttgcta taattcaaca tcttttcatt tttatgccaa tctacttgac atgtttaggt 18960 

taaatgatga tatctgtaca gagtaatcct ctgatccagt atttgcacat cttactttct 19020 

aacgtccata gcatagatac acatcttata ctgttgagta catatatatt taaggtattt 19080 

accatagtct tataatatgc agcgtgcttt ggttcaagac agttgccctg tgttcctcaa 19140 

cattaacatt tttttcatca caaatacaca ttgaccttta tcaaattttt aaaactatct 19200 

tgagagaaat gaccattttt cttaatctgt taatgtaaaa- tttttataaa aatagttata 19260 

aataatatta gcctacatat ttcttctgtt ctctttttca actcttagaa tcagagtatg 19320 

gtagtctcag actaaaccag gagcttccta tctgtttctc tgttcttaag tcacttatat 19380 

aatgtaagga tgctgtgtat atctgccagc taggccttat atacaaaagg cacccatcac 19440 

aaccttctaa aacagtctta ccacttagag accatgttca aacatatggg cctttgaggt 19500 

aattgccaca ttcaagctat aatattgtta tctaagggaa tatcttcact tctagcagat 19560 

gcctaaaaat atctaaaggt aaacactggt aattgctgtg tttgttgatg ctgctcttcc 19620 

tcctcctcct cctcctcttc ctcctcttct tcctcctcct cttcttcctc ctcctgcttc 19680 

tccttttctt catcctcctt tcttttctta tttttgaggc atgatttcac catgtagccc 19740 

taggtaaccc gtaacttact atgtatgtag accaggctag cctctgtctc ctgagtgctc 19800 

atattaaagg tgtgtatcac catatccagc aacacttgct ttgagatggt tagaggaaaa 19860 

aaaaatatac gtaaataaag atggatgcca attactaaat tgttacttcc agtcaaactt 19920 

tgtacctagt ctaaggccaa aatagggatt ttttttctac tttgcaagtt ggctccatta 19980 

agaggctttt cttctcttgg tctcactaga taggaaggag agagaggagg gaaggagaga 20040 

aagcggttga ggagtgggag gtagtgtgac cgagaatacc cagtaggctc atatatttaa 20100 

atatttggtc cctagttgat agaactgttt agaaagatta ggaagcatgt cttaggggct 20160 

ttgaggtttc aaaatttaat gctagaccca gtctttcaag ggagggggcg gtctgtctct 20220 

ctctgcctgc tgcatgcaga gctctcagct actactctag tgtcaagcct gtgtgcttcc 20280 

tgcctcaatg atcataaatt aactgtaagc aagcctccaa ttaaatgctt tcttttatag 20340 

ttaccgtgat catggtgtct cttcacagaa atagtaacct gtggtgattt taatatgcct 20400 

ggaccaggga gtggcacttt taggaggaat ggccttgtta agaggaagtg tgtctctgtg 20460 

ggggtgggca atgagaccct cgtcctaacc atgtgagaac cactcttctc ctattggcct 20520 

tcagatgaag atgtagaact ctcagatcca cctgcaccat gtctgcctgg aagctgcctt 20580 

tgttcccacc ttgctgcccc aattaaatgt tgtacttata agaattgttt ttggggggct 20640 

ggagagatgg ctcagcagtt aagagtactg actgctcttc cagaggtcct gagttcaatt 20700 

cccagcaacc acatggtggc tcacaaccat ctgtaatggg atctgatgcc accttctggt 20760 

gtgtcagaag acaggacagt atacccacat acattaaata aataaataaa taaataaata 20820 

aattcttttt aaaaaagaat tgctttggtc atggtgtctg ttcacagcag taaaacccta 20880 

acataaccct gactaagaca acaagtgagg aaaggtgttg tgtgacactc tggatctctg 20940 

gaagctcacc tcagcatgaa gcttgtcgaa gcgnnnnnnn nnnnnnnnnn nnnnnnnnnn 21000 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 21060 

nnnnnnnnnn nnnatctaag tacactgtac tgtcttcaga cacaccagaa gagggtgtca 21120 

gatctcatga cagaggttgt gaactcagac ctttggaaga gcaatcagtg ctcttaactg 21180 

ctgagcatct ctccagccca aaataattct tactagtaac atggaacaat caagttttat 21240 

tatatgatac atattaatca acttataagt acatgattat gcacatttat catatcgtgc 21300 

aaccatcact gctgtcgttt tgttttgttt tgttcttttg aggcccggtt tctgtgttgt 21360 

tctggaactc actctgtaga ccaggctggt cttgaactca atgatctgcc tgcctctgcc 21420 

tcccaagtgc tgaaaacaaa tgtgtgcacc accacctctg gctatcactg ctgtcttttt 21480 
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ttttttttta acagttattt atttcgtgca tgcatgtgtg tataagcatg taacgtatgc 21540 

catggtatgc atgtggaggt cagaggacaa ctttcaggag ttagttcttt cctcccactg 21600 

tgggttctag gaaccaagct caggttgtta gacttgcatg gcaagtgcct ttaccacaga 21660 

gccatcctgc tggccctact ataggtcctt atataaaaag atcatatgcc gggcaaaaac 21720 

caaacaaaaa ataaacctca aaaaacaaaa ggaccatata atattgtggg ggagtggatg 21780 

aagtcctgaa cgaatgtgtt ctgttgacat gtctgtactt cagacccatg ggaattggca 21840 

aagccttcct ctggtcctgt gaggatgctg atagtctgtc taaaaactag agatcacagc 21900 

tttctcctct ggatgactgt aaccccagat tgttcctctt cagagactgt ccaccaagct 21960 

accctgccta cttaagctgt acacaatgaa tgagctgagt ttccaggtta cagcacagta 22020 

gacactgtcc atcagtgaga gcacagccta gcctaacagt acacatgtct gctttcttca 22080 

cgtttccaga accaagcctt gctggataga gcatatttgt ctgtttggct tatttcactt 22140 

gataaaaagt tttcaaggag ggccaggtgt ggtggcacac gcctttagtc ccagcactcg 22200 

ggaggcagag gcaggcaaat ttctgagttc gatgccagcc tggtctacaa agtgagttcc 22260 

aggacagcca gggctataca gagaaaccct gtctcaaaaa accaaaaaaa accaaaacaa 22320 

aacaaacaaa caaacaaaca aaaagccaaa aatccaaccc cccccaaaaa aaaaaccaaa 22380 

ccaaaaacca aaaaacaaca acaacaaaaa gtttttgagg tttaatttat tgcatgtcac 22440 

agaatttcac tgtttaaaaa aatggctgaa taatatttca ctatccattc acgtatttgt 22500 

aggcattcat gtgtgtagtg gtttaaataa aaatagcccc cataggcttc tacagttgaa 22560 

tgcttagtca ttgagtagca gtactagaga gggaattgaa ggtgtggcct tattggagta 22620 

ggagtggcct tgttgcagga attgtgtcac tttgaggtcc cagcaacaag gttgctctga 22680 

tcacatccaa agacattcta ggtctatgtg atctggctgg aattcagaca tgcccttaat 22740 

acacaccttt aatcccaaac aatgaaggta aagttagttt ataaaaagaa gcacccatgt 22800 

ttgaaagtga cgtttaatta agagtgatga attagagaaa gatctgctgt cacagagcag 22860 

agaggaaaga gaggcagcat aagagggagc atggcagagg gagagggagg aggggttttc 22920 

accagggcat ttgtacagag acaggttgca gagctagaac aggtgaagac agaacaagcc 22980 

agagaatgag aaggagccag gagattagga cagattgcca atgttaatag gctaagcaga 23040 

gcattttagt cagaaactga gagaagtcaa attgaatcag- ttagcttgga aaggagtttg 23100 

agcagcaaca gctgagttaa actagccaac agaatccaga aagaactaga aaagatgagc 23160 

ttactcagca gcaaatctca gaggctaaaa acatcttaga' cctagattag actgcatgga 23220 

ggctagacgc ttccagggct aggcctaggt tagcagacgg agagagtaat aagccttgga 23280 

gacaacagtt aatacagaag actatgtaca gacatggata tgaacctctc agccacttct 23340 

ccagcgtcat gcctgtctgc attgttagga gtcatctagg aaaggctaag ggcaggcaag 23400 

caacttttcc agagatggtc cactgttttt tgcatggctt ttgagaggcg agctctgaga 23460 

gggaaggttc caagagactt catcccagga ttgctgctta attacgacat gccttttctt 23520 

gtcactgtta tttagtataa tgactcctga gctttagccc atcctattgg gcatatttcc 23580 

tgcagatcaa cataaagatg aactttcaca aattaatgct gtttagatga ataaatgatt 23640 

ttataaaatt cctgatttga tttaaataat tttaggaaga aagctttagg agatagttta 23700 

gttggtttgc cagaaagatg taataacgtc agaatcaaga atagaatgtg gctgggcagt 23760 

ggtggcagat gcctttaatc ctagcacttc ggaggcagag ataggcggat ttctgagttc 23820 

gaggacagcc tggtctacag agtgagttcc aggacagcca gggctacaca gagaaaccct 23880 

gtcttgaaaa acaaaacaaa aagaaaagta agtaaaggct gcataataaa gaatacaatg 23940 

agctttcaca actacaccaa aaagagacat gcttgggaca aatttgtgat caaggaaaaa 24000 

tattcattct agatcaggtc caaggatgaa gccacaagtg tgtgatatga tgaacaagac 24060 

catggataaa ctgttgtttt gagcttaaag aataaaacac tgctttgaaa ttaactatca 24120 

acattctact gtaactttcc tttttataaa ttttatctat gagataattt tctaaagaac 24180 

ttgtgtctat aaaggtatag aaggacagag agaaagaaat aaggtgtggc atctgggctc 24240 

tgctccatcc acccaaataa atatgtgtgt gtgtgtatgt atgtatgtat gtttatctat 24300 

atgtatgtat atacatacat gtgtaggtag gtatatgtgt atgtatataa gtatgcatga 24360 

acacttggga agttgatgag acaagtgaga ggttgggccc ccnnnnnnnn nnnnnnnnnn 24420 

nrmnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 24480 

nnnnnnnnnn nnnnnnnnnn nngaattcac tctgtaaacc atgctggcct tgaactcaga 24540 

gaaccgtgtg cctctgcctc taaagtgctg ggattaaagc atgtaccacc acaacccagc 24600 

tagtttaaat gtttcttatt tttttgttta tgggtctttt acctgtatgt atgtgtgtgc 24660 

accatgtgga tgcatggtgc ccttagagtc cagaagaggg tatcagatcc cctggaactg 24720 

gagtgacaga gggttgtgag ctgggacttg aacctaggac ttctaaaaga gcagcaggtg 24780 

ctcttaatag ctgagcctta tctccaggcc gtcccatgga tttggggggc tttgtttcat 24840 

tttattttgt tttgagacag ggtgtgtagc tcatgcttga atttactatg aagccctgac 24900 

tcccctcaaa gtaaagatcc tcctgcctct gtctacagct gctaggattc gaggtcttgt 24960 

accacatgct cagcacagcc atgattcata acaataaaaa aagaaagaga gacctaaatg 25020 

gccttagaga taaataaatt attttttttt taaagattta tttatttatt tattacatgt 25080 

aatgcacact gtagctgtct tcagaccccc cagaagaggg agtcagatct cattacagat 25140 

ggttgtgagc caccatgtgg ttgctgggat ttgaacttcg gaccttcgga agagcagtcg 25200 

ggtgctctta cccactgagc catctcacca gccccgagat aaataaatta taatgtatgc 25260 

gtaaggtggg atcatctcag tctccgggaa tcttgcctgt tactccttcg ctctcccttc 25320 
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tattcatgct tgggtaactg gccctggctg 
gtggcaggga ccactgcgcc cacagggctc 
ctgggtggaa gtagagggct gcatatataa 
aattttattt aaattttata cagctctata 
ggacctgcca acggttgtca agggtaatgg 
tttacttcca tactttctac attaccccac 
aaatagtgac agatgacttt gaaggaaaat 
cccgggaagc ccaggctaaa tgtaatctgg 
tggtcagctg agggaccctg gtgaccctct 
gctgagtacc caaagtgctc ctctgggctc 
caaaggatac aggcatgagg actgaggcct 
aggaggcaga gctccaagag tccaaccaag 
gatgggatta gaaaggcatg tttgcaaata 
aaccccaccc ctgggtgtcc cttccctgcc 
gagaaagtcc caagaccagg ctggctggag 
gccctcgaag actagcttgc tcgcccacac 
cctcccaccc tcgtctcttc ctccaacctc 
tagaacttgc cagtgctcac tgtaatgtcg 
gggagcaggt atgcctcaga tctccctcta 
tctctgacag ttcctctcct cctgaatcct 
ttccatctca aaaaacaaac actgggaagg 
tgccccagct tggctgggaa ggggtaactt 
cagttgtcaa ttaattgtaa atttcaaaat 
atcccaacac aaagaaatga taaacattca 
gatcaccgca cattgtatac atgtttttga 
aattaccgtg cagtgattca agataaaaac 
aatagctgcc cttgaccccc ccacccccac 
tgtcagagct tcctgtggga gggtggtttt 
tccccataag aggagtcacc cctgtcccac 
aaccagtaag atgtgagcag ggatgatacc 
caccaaacca aaaacaaaca aacaaacaaa 
gtcttttccc acataaagtt aagcacacaa 
ccccacccct ccccgagaca atgtttctct 
ttttgtagac caggctggcc tggaactcac 
cccaaatgca gggattaaaa gcatgagcca 
tctagctgca tacgtatcaa aagatgacct 
tggacacgca aactgtatat gcctcagtac 
tgggtgggta gggaagtggg ggggagggta 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnggcagga tcctgtgttc 
gtttggttct gagcacctga aggggaccaa 
ctggccccac tgcccactcc caccctctgg 
cactactctg ctcctctccc agatqcacgg 
gctaacaagc tgggcagtgt cacctccctc 
tctcccacag tgagacactc atgtccactg 
taaaataact agggggactt gtcaatcact 
gagggtggtg acagctcagg gacctgcatt 
gaactgcccc tggacaacct ctgcagcaac 
gacaggtgtg accccagtac ctagtgccag 
tgacaggaca aggttggagc tggtagatgt 
aaaagccatc atggcctgag ccagcctgcg 
gtgtcctggg gacctcttga gggtagcatg 
agtcagtgag gtcaagggaa cagctctagg 
tttttttttt tttttaagat ttatttattt 
agacagctcc agaagagggc atcagatttc 
tgctgggatt tgaactcagg accttcggaa 
tctctccagc cccctctcag tcctgatgcg 
gaggagagtc atgctgaagt ccttcctacc 
ccataactac tgactccccc acccccattc 
ccctctggcc ccacccattc ttccccagcc 
aagagggcag gcaggggcag cttctgctgg 
aaaggagatg aagaaaaggc ccttcccatg 
ccctttggct cttcccccta aacctctcac 
cggcgctcac cggagagaac tgttgagaaa 



attgatgaga gctgatttcc ccactgccct 25380 

cctcaggatc ctcagtacag agctgcacag 25440 

cacgatctca actttatttc tttaaataaa 25500 

taaacgaagg aactattgaa ggttcagcaa 25560 

cgatgtagtg attttttttc ccccttccat 25620 

aactggcaag tattatttta aaatgaaagt 25680 

tgaatcggta aaaagaaagc tgagagacca 25740 

gtcaggcctc ccaggcctgg ggtctcaaga 25800 

tgggccagca gggacgggga ggagccggaa 25860 

caagggcctg cacagagact gtgtgggaat 25920 

gacgaaccca gctatcattc gtcctagaac 25980 

aggcaggaag ctttgggacc cgagatgggc 26040 

ctttcaaatt tacgatgcac actcactgga 26100 

tcttgccaca cccaatagct gacatcactg 26160 

ctcctgatag gttccaccct cctgcagagg 26220 

cgccagatgt ctgtgtcttt ctctcttttg 26280 

agtggagggt cccctgcttc ctggggaaag 26340 

tccctgtagg tgtcatggtc ccccattact 26400 

ttcgctgccc tttcaggctg tctcagtttc 26460 

gcttgttggc atgcgaacag gctcaatatc 26520 

tgttgagaga cagagagcat gggtaatggg 26580 

acaatgctct actgcccagt agggtagctg 26640 

agctagtaga gaggatttta gatgttccca 26700 

aggcgatggg tatgctaatt gctctgatct 26760 

aatgtcaggc tgtaccccat aaatatgtac 26820 

tataatttta aaaagctaaa aacagaagga 26B8Q 

aaggtccttc- ctgtttgtcc agccacttaa 26940 

ggtgtacaca gacactcctt cctccctcct 27000 

gatgccatgc' agggccacat gcgtgatatt 27060 

tgtctcttat aacaaacgga aaaaaaacca 27120 

caaacaaaaa cagggttggt ctgtccctgt 27180 

agtagccacc atttatttat ttgtcccctc 27240 

gtataacagc cctagctgtc ttggaactca 27300 

agagacacag agattcacct gcctctgcct 27360 

cgaactaacc agtaccccag agctcttgac 27420 

agttggccat cactggaaag agaggcccat 27480 

aggggaacgc cagggccaaa aaaatgggaa 27540 

tggnnnnnnn nnnnnnnnnn nnnnnnnnnn 27600 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 27660 

atgtgcaaca ctcgatgcaa gctgtgtagt 27720 

gcaggctgat gcccaggcca cgggttcttt 27780 

catccccatg atgaacatgg ccacagatca 27840 

agccataggg tccccagatt catctctgca 27900 

aaggttcctt tcctgctctg agcagcagtg 27960 

gaagatattg tagccattaa attcctgtgc 28020 

acactcttag ccccggactt ctgactcata 28080 

ctaccaaata gccatgtgtc cctgatggag 28140 

tgaaccctct gtggtctcct agttcttctg 28200 

gtgagagagt gctagggcca cactaagggg 28260 

ttgggccacc aaagagaaca ggtcagtagt 28320 

agtctcctct gcagttggga cactcttgca 28380 

gtcaccaaaa tcctacaagg acagatcaga 28440 

ttctctgtgt ccctcacgga cctttttttt 28500 

attatatgta agtacactgt agctgtcttc 28560 

gttacggatg gttgtgagcc accatgtggt 28620 

gagcagtcgg tgctcttaac cactgagcca 28680 

acagggcagc aaaggccttg tcccagatct 28740 

ccaccccttc cgaacccctg aacatcagcc 28800 

ccttgcttcc actgatccgg tcctcctctt 28860 

ccacctgatt gtacctggtt gtccaacttg 28920 

gcctgctcac tcactggctg tagaaatgag 28980 

ggtccccatc ttgccaagac ataggtgagt 29040 

ttttgagtac ctgctggccc gggagatcca 29100 

agggagaaca gagaactcag cgttcctccc 29160 
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tctccaccct tctggcctct cccagatttg 
actgaccact tcccactcag acctcagctc 
caggacaagc tgagtacgag gagcccccgg 
ggagtgttag ttcctacctc acgctcgaag 
cttgaccgca ccaggctgcg gaacctggac 
ctctgagagg tgggatttcc ataagctttt 
tttcgtctta cactgtggat cagtcctggg 
gaccttgcgc atctcccttc atgcctttgg 
agttcttcct gctcccatgt agatgttgag 
gcatgatgcc tggcccccac cctagcaaag 
ctttctcagg ctgccctggt cctacccaaa 
cagccagaaa cttctttgtt gaccaatgaa 
gaactacaag cctcatccca cttctgctcc 
ccagctggcg gaagggggga ggctctcagc 
cctacccagg ccttatttca ggcaccagct 
cctccacaac tgtgttcaag aaccttcagc 
tttttctcgc caagagtagg cctccaggtt 
cctggacctc tgctcctcac caccccagga 
ttcatccaga ccccttgcca aggtgccagc 
gctgcaccac ccctccatcc tacctcttca 
gtgttctcct ggaccagctt gctccccacg 
ctctcctccc acaggacctt tgctctcagc 
gtaaatattg acctgtacat ccggttaaac 
atcaaggcta cccagaaccc tgcctgagcc 
gaggaggttg ggcctggcaa gccttctaat 
cagtgaggct cagagaggct gtgtggcctg 
cagccaaagt cccaccacca ccaccaccac 
caccaccacc accaccacca ccaccaccac 
caccaccacc accaccacca ccaccaccac 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
gaacatctct ccttttttcc tttttgagac 
gtgtgatcct tctgcctctg cctcccaagt 
tgcctacaca atgtcttagc agcccttagc 
ccctttagct ggcttctctt gtgaagggct 
ggctcttagc cccagagcct tccttccttc 
ctcccatttc cctctatctc cctaagcaac 
cttggaccac aagacaaaac tgcagcctgg 
ccccctggag tcaggtctag gggaggaaga 
ctgtcctagt gtgccctcac ccatcctata 
ccatcaccat cctctcatgt acacgtgccc 
ttctgacttc ccagaaaaat atctgatctg 
ggtccttaaa attaagtcag acaatccatg 
tcgttccttg ttcactctgc ttttggcagc 
taatccctca tctatagcag gtctcccggt 
caggggcact gccactcagc tgctcaggga 
gcagatttag gcttcttgtt tcccttccca 
tggctgagcc tcaggaggct gatttatcag 
tctgggtagg tagcaactga gacaggagga 
agtaaatgag atacccttgc agatgggacc 
ggtggtggga gtctaagtca cagatctttg 
tgaggtggga aatctgatgt gcatggggaa 
caagcaggga caggtgctag ggggatgtct 
ggaaatggag actctaaagg gcagaaagtg 
atgcaaggcc ctggctgcaa taactgggtt 
ccaggacagt gatgccatcc tgtccaaagg 
gggggaggag ggtccgggga gaggggtcaa 
tcccatgtgc cccaaatcag gaggcacaga 
tgagaagaca agtgggagcc agtacatgtg 
tgttgctcct cctcctccca gagccaagct 
gacagtgcag gggaggacac ctctctctcc 
ggctggctgg ctcatggcta cgccgctcac 
gaaatcttgg aatctgaatc agcctgagat 
gaaaggtcag ccctgcagaa cagaactctt 



cccccgcccc ccagcatctc cttcagcctg 29220 

tgcctcaccg tgaaacaggg accttgcagg 29280 

agcagtgcca tgttcctgta tccagaacag 29340 

gccaagcagt agactgctat ccatgggttc 29400 

tcaaaacata gcagctgtgg acctcactca 29460 

tttttcacct gtacatttag tcttcattct 29520 

ttcaaattta aagccctcat cttgcaagag 29580 

ctttaccctg tcttggtaat tcatggcaga 29640 

gacccaaata agaatctctg taaatactga 29700 

ccacctgacc tgttgttcat ttcatccagc 29760 

ggctctgaga gctaatctgg gctggcaggg 29820 

tgactggccc agacaccttt ggacttacgg 29880 

aagttctgat ccagggtgct tcggggaagc 29940 

ctagagagcc ttcctttcca tcctcagccc 30000 

cttctaaaag gtccttctgt tatccctaga 30060 

cagggcctca tctccaatct ggatatatga 30120 

ttggagttct agaggtttct cctggagctg 30180 

cgctgtgaag ctgcaggctc cctgaataaa 30240 

tgtctacttc ctctgctgcc caagcagcag 30300 

ggcttcttag cgcagcacac gcagcacacg 30360 

ctcccccagt gcagccagca gggcctagct 30420 

caacccccgt tcagcttgtg ttcagtgctg 30480 

attgatatgg gggccagaag accctttccc 30540 

tggagaaggg gtttacagga gcagataagt 30600 

gatccctcaa cataggggat tatccacagt 30660 

tgtaagggcg cagagtgggc tccagagtca 30720 

caccaccacc ■ accaccacca ctaccaccac 30780 

caccaccact accaccacca ccaccaccac 30840 

cacctcatct ' acccatacta anttgaggct 30900 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 30960 

nnnnnnnnnn tctatgaggg ttacatttta 31020 

aatcttacta tatttaggct ccccttgaac 31080 

gctgaggtta caggcatgca cagtcacatc 31140 

agcaccaggg gtcaggaagc cctcaactgt 31200 

atgtcttctt ccccttccta gcatggagac 31260 

aggttaaaca gcaccagttt ttggtgggac 31320 

gaccttttct gctctgactc tcatctggca 31380 

gctgtgtgtc ctcgcacatc attcctgtgc 31440 

cagggttcac gactcagaaa agaccactgg 31500 

gcacgcacat gctgatgtgc cccctccgct 31560 

tccctcgcca gacacatgca tcactaactt 31620 

agaagttagg agtctgccat catcagctat 31680 

ggacatgaag ggcaacaacg agaagactcc 31740 

accaccagca ggaaccaacc tggctctccc 31800 

gggaatttta gggacctctg tgttctcatc 31860 

gagacccctt agaacaacaa agaaatcaat 31920 

gcccctccca tcacaggcaa cagcctccct 31980 

agaggtgctc agagaggcac ctctggtccc 32040 

gatggtcacc ctgggcatcc tctaccagga 32100 

cctgaagttc ctcccggggg cgggggtggt 32160 

ttaccacgtg gttagactga ggactgaatc 32220 

acacagaggt ccaatgctgg ccaagagcta 32280 

gaatgttcca ccccaagcca caggaataac 32340 

agggtgtgca gcaggggctg cacaggacac 32400 

ggggaggcag tcattggcta gccaggggca 32460 

gcagtgtcca agccagattt ctaggctcca 32520 

gattctcccc ctctgagtca aggttggcct 32580 

aactgggatg ttgtggtctc acatccaagc 32640 

tttcagatta aacccagtcg gagacaaaca 32700 

gccttcaagc cacatggcag tgaatatgcg 32760 

actggctcaa ggacagtttc aaggggttca 32820 

cccctggaca gtttggggtt tttccctcct 32880 

accccataat tgtacctccc aacaccccca 32940 

tggtccccac ccatccccct cagccctgga 33000 
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ggctgaactg atgggcagct aaggtccaga cagtggctgg ctcttggaaa gcctgtctct 33060 

ttcctttgac tcagaccact ccctgccgtg gcttacatca ggaggtgcaa gggctgcagg 33120 

agggcagcca gaccccacaa accagctagg ctaaatggtg cttattgttc gcaagaggcc 33180 

atgacctcat ttgtctccca gctcttttgg taagagagaa tgagaggaag ctggacagag 33240 

aacctagcag gcctcaggca gcccactgct ccttgctgta agggaaccag caccgatggt 33300 

tctgaaaagc agcgatccga atggagtcag gctgagctgc aggaagctca ccttccttgc 33360 

tcactgctgg tggaagcaac ttcaggaaga gcccagccta tgggactata gctcctccgg 33420 

ggtactgctg agtccagccc cagagcttag ctccctgctt cccaccaccc accaccacat 33480 

cctttcccaa caccattcaa aaccccagtc cagcctctcc tactggtcta cagtgagcgg 33540 

ctaatagagt cctgggcctc tgtcccccca attctctctc ccctctcatc tgttcacctt 33600 

ggttcctaaa ctgcaggggc tactataacc ctacctccac ttccttgcac ccctcttttc 33660 

tgctctctgg ggtgcccctg ccactcccag tccctctagc cagggagcct cttccatatc 33720 

tgtcttcccc aggctagacc aggcgctgcc ttacctgtgg ttgcggcagc ttctctcaca 33780 

gcctgcactc tgaggggctc caggaagcag tgaggggagt agctgcctct caaccagcgt 33840 

ccagcaggct tcagattaca gctactcttt tcttaaagtg acctgactcc atttggaatc 33900 

tgtgattgca tcattgtctg gtgttaactt taacccactg ctgcccttcc gccatgtggc 33960 

tccaagacca cacgttggcc accctcctct cccaccacat ctcccttgga tctttatctc 34020 

tcttcattgg gaccttcatt gggacatgat ggctaacttc aggggcactt gggccagcct 34080 

ggggtaggtc atgagtctga acttgaacat ctgaaaggat tggctgagag gcaggctgca 34140 

tggagagact gtgagccagc cggtatggag atgctgggtt cttccaggcg cttggctctg 34200 

gctcactgca ggtgggagca aggtgattct tctcccctcc tcacctggaa aatgaaggaa 34260 

tgggactgta cctgacagct ctgaaggttc caaaggacag tggggtgggg actagagagt 34320 

tggcccagtg cttatgagca ctggctgctc tcgcagagga cctgagttct gttcccagct 34380 

cacatagcaa ggactcgaaa ctgcttggaa ctccagctcc agagaatctg acgctatctg 34440 

ctnnnnnnnn nnnnnnnnnn nimnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 34500 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnatgtcagg catggtggtg 34560 

catgccttta atcctagaac tcaggaggca aagcagatgg- acatctgaat ttgaggcccg 34620 

cattgtctac atagcaagtt ctaggctagc caggtacatc ataagaacct ttctcaaaaa 346B0 

ataaataggg ccagttggca aaatttagct tgccctccta' acacaagaac ccaaagtcaa 34740 

ccccagcaac catgtaaaaa gaagcaaggt gtggtggcac ttgcttgtga tccagcattg 34800 

tcaaggtgga gacagacgga tccatggggc tcactggcca gccagctagc tggtctactt 34860 

agtatgctcc cagccagtga gagactgaaa aataaataaa taaataaggg gttaggaaga 34920 

ggtaacatgg tggctcagtg agaaaagata cttgtcatac aagcctagca accctcaatt 34980 

ttcagtggcc actaaaggtg gaaggagaga accaactcca aagaattgtc tcctgacagt 35040 

tttatgctgt ggtacacaca cacgcacaca cacacttgtt acatgcatat gcatacaatt 35100 

aataatttaa aatgttatgt gtatgggtgt tttgcctaca tgcatatctt tctgtgcacc 35160 

acatgtgtgc aatgcctgtg aaggctagaa gaggacatca gatcccctcg gagttacaca .35220 

gggttgttag ctaccatgtg gattctggga acaaaaccat gggttttcca aaagaggctc 35280 

ttaatcactg agccatctct ccatcccctc aatagtatat atttctgggg ctggagagat 35340 

ggctcagagg ttaagagtat tgactgctct tccagaggtt ctgagttcaa ttcccagcaa 35400 

ccacatggtg gctcacaacc atctgtaatg gaatctgatg ccctcttctg ctgtgtctga 35460 

agacagctgc agtgtactca tatacataaa gtaaataaat aaaccttttt tttttttgtt 35520 

ttgttttttt tgtttttcga gacagggaga cagggtttct ctgtatagcc ctggctgtcc 35580 

tgaaactcgc tctgtagacc aggctggcct tgaactcaga aatccgcctg cctctgcctc 35640 

ccaagtgctg ggattcaggg tttgcgccac cgccaccacc tccaaggctg ctgctgcggc 35700 

caccaccacc ccaccaccac actacctgac tatttaactt ttaaaggcag ccatctcatg 35760 

gaaaatgaca cctagcattg tcctctggtc cctacatgac cccatgtgca aacacatacc 35820 

tgcataaaca cacataaata cataagtaaa cttagtctgg ttgttttgga aatgtgctat 35880 

ggtttggatt gtgtcccccc aagggaaaaa ttgggtccca gagtagtact attggaagag 35940 

agtagaactg ttagggttta ggcctggtgg gaagtggcca tggctagaga gacatgccaa 36000 

ccaaggggaa tctctggctt catcttttcc ctttgctttc aggtcctaag acagtcacaa 36060 

ggctgcttca ccacatgccc agaagcaagg ggagcagtca tggctgggac cgctaatgca 36120 

gctgttgatt tccccagata tttgtagtag taagagacag gtgaggaacc ccacagcaag 36180 

tgttagtaat tgtgtgtgga ggtgccctcc ggggacgggg gccctcctgg ggcaggacgt 36240 

tcctcttcct catccacctg cactccgaga acaggaaatg gtgactttgg cagagcttaa 36300 

gcagagcccg ttcatgttac aagtatgtaa attcataagg accagtttct ctccatatga 36360 

aacagcttca aacaggagaa ggaagaagca aacattaagg aaaagctctt ttattgcaga 36420 

ggctacactg aagctaccgg ccgccttcct ggaatgtata atcagcttcc ctctgggggt 36480 

tctgtagagc actgagacat taagtactac tggggtccag gattctgcct atgaagagga 36540 

gggcccccgt gtccgtgtcc ctcagaacaa agaggaaagg ttggttaagg tgatagtcta 36600 

gcgggaaggt gaggcggacg ggctggaggc ctgggctggg gctgcttcct gccccctctt 36660 

cattccactc gaaagcagcc ctgtgttcca cttgggtgag cttcacgggt ttgccagtaa 36720 

tcttgctgaa gtcgggtgat tcaaacaacg actgtagctc tgtggagatt cagagattcc 36780 

attaacacca cacacacaca cacacacaca cacacacaca cactccctgt ttgtgtaggc 36840 
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tgattttcaa gaaagcaagc tagaagtgga gtacctcaca gtgacttgtg agctatgagg 36900 

cactctgtga caggctcagt gacctacctg agaacttata gccaagatgg ctgaagccag 36960 

acctggcctg agagaatgtt ttgggctgtt ataggacaca tagagataca cacacacaca 37020 

acacacacac acaccaagga ctgagtctaa tgggaggtgg ttcttcattc ccctcccctg 37080 

taatggtgtc acatgttccc tgagccaccc tacaaagaaa gccacaggac tcagttctgt 37140 

cagcaaggtg gcaggctcca agactcagcc ccgagcgcaa agtggccttg caaacatact 37200 

catgtcctgc agagacttgg taagttcgcc ttcgaagctc agcttcagct tggggacagt 37260 

cagcacagct tggatagtct tcagttctcg gtcgatgtca tgaatgaact cagaggtgag 37320 

gctctcttct atcatggtca agttctgggt cacggtcagg ggcaggaaga agatgatgct 37380 

catacttcct gtcaagggca gctgggcaat ctaacccaac agagatgcgc acaggttagt 37440 

tgtgagccag aaaaaacaaa acaaacaaac aaaaaaacac caacagctgc cttcccctct 37500 

gctgtaacgg ggccccagcc ttgtgctccc cagcctcagc ctgggctgta ggctactggt 37560 

tactggcagt ccttccatga gtagggagtt ttcttctcag cctaaaaccc acagaagttt 37620 

aatgaacaca cgtttgtttg tggttccgct acggtttcta ttgtgataaa acatgactga 37680 

aagcaacttg gagaggaaag ggtttatttc atctgacaat tcgcagggtg tcttctcatc 37740 

actaagggga ctcagggcag gaactgaagc ggaagccgtg gaggaacgct gctttctggc 37800 

ttgctccccg tggcttctta gcctgctttt ttatgctatc cagaaccact tgcccaggag 37860 

tgacactgcc cattgtgggc tgggcccccc cacatcaatc actaatctag aaaatgaccc 37920 

acgggtttgc ccagaggcca gtctggtggg ggcattttat caattgagtt tcacccttcc 37980 

aaatgactct aacttgtgtc aagttgacca cacgaatcag ggcctggttc ttaggagctg 38040 

aagtggaatg tcccccagag actgcctgcc agcactgctg accatttgct ttgtatagag 38100 

cattgaacca gaaatgaaca ataaaatgga tcctttgaac agatgtgttg atcctagggc 38160 

ctgtggacac agcgactggg cttcccagag cccccatgga atcannnnnn nnnnnnnnnn 38220 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 38280 

nnnnnnnnnn nnnnnnnnnn nnnntggcct catcaggtat cagagagaga gagagagaga 38340 

gagagagaga gagagagaga ggataaaagg ttagcccagt ggtggtggca cataccttta 38400 

attccaacac ttgagaggca gaggcagggg gagctctgtg- ggccagtttg gtttacagag 38460 

taagtttcag aatagccagg gctacacaga gaaaccctgt cttgaagaga aacacacaca 38520 

cacacacaca cacacacaca cacacacaaa taagatcttt aagaagaaaa gaaaggatag 38580 

tggggaaaca tctgagcaga ggaagaaatg gggtgcgcag gacacccacc ctcagaggag 38640 

gccctcactg gaggtgtctg cacaggagaa cacttgcact cagcttgccc tagggcgtca 38700 

gaactcagaa ttcagtttca aagcactgac aggagcagtg actggggacc ccaggttgaa 38760 

tccccccttt atctaaaatg agtaagaacc aaaaaaacaa aagtgtttgg gatttggaat 38820 

ctgggttatt tgcatctaca aaagaggtct tggggaggaa acccagttct acccctggaa 38880 

ttcatcagtt tcctataccg ctgactacac aggggctgaa ggtaatctca atgttttcat 38940 

aactggtgtg gtgctacttg ctcatgatcc caacacttgg aaggtaggtc agaagttcaa 39000 

gagcagtctt gactactcag tgaatttgag gctagcctgg gctacatgaa aactcataaa 39060 

acaataaaag aaaagaaaag gtggcgagtg aggtggccca tcaggtaaaa gtgccaacca 39120 

cctcgcctga aagcctccac acggaagggg aaagccagct tctacatgtg gtcctctgcc 39180 

ctccgcatgc accatggctc gtgcaccccc acacccaccc acccacccac ccacatgaca 39240 

taaatacttg taatgattag tttctgaaga acaatatttt cgttgatctt gtttagggaa 39300 

caaagttcgt gcacattgac ctgtcgaccg tgtagtacgg gatccgctcc aggaagctaa 39360 

agattttggg atgcttcatg ttgcagctac tctgatgaac agtgttgctt tctgggccag 39420 

gtaatggtgg catatacctt tgatcccagc acttgggagg cagaagcatg tagatctctg 39480 

tgagttcgag atcagcctgg tctacagagt gagttccagg atatccaagg ctatacagaa 39540 

aaacccctgt ctctaaaaat cactaattta aaaaaaaatt cctttctaaa cctatataac 39600 

aaatgttttg taggctgcct taacaaagcc caatggccat tcagagaagg ctcaaaagag 39660 

aaagtttagg ggactctaca agcatcctca ggaaggccac agaaagcaga gcctgggcca 39720 

gtgagacttt gcagtgggca aggttcagct ctttatgtag gaagaagaga gtcaacagtc 39780 

agagtccagc tttccataaa acctgtgcag ggcctctagg caaagccctg tgttaggggc 39840 

aaaggcattt gcagtctaag cccggtgaca tgagctcaat ccttggaacc caggtggaag 39900 

gagtgtgctg actccacaaa tttgtcctct gatctataca tgtatgcacg tgcacgcaca 39960 

ctcacataca tgtccacatg cacacatgca tatacatgcg catgcgtgca cccacacaca 40020 

gggtcaaaag cagcaagaga tgccctgtga aaaacgtctc attcagtctc ccatcatcca 40080 

gtgccacact ctgagcacag gtggtactga tatcgttcct gattgatcga tcagttgatt 40140 

tgagaccccg cctcactatg tagcccaggc tggcctggaa ctcacagtga tcctcttgct 40200 

tctgcaagat gagcccatca tgcccaccat gttattgaag caataccatg ctctataaag 40260 

caaacctagg caggcaggat ggtggactcc tgtaatctca ggacttgaaa agtagaaggg 40320 

agatgaggag ttcacatcaa tctcccgtat gcgttggagg ctggagtggc tgttccctgg 40380 

gcgcttctgc cagcacctga ccaatgcaga tgcagatgct cacagccaac catcagactg 40440 

agctcgggac cccagtgagg gtgctggggg gaggactgga ggagctgaga cgggattgca 40500 

agcccatagg aagaacaatg tcagctggcc aaaccaccca gagctcccag ggactagacc 40560 

acgaaccgag gactgcacat gaagggatcc atggctccag atgcatatgc agcagaggac 40620 

agccttgtct gacagcatgg gaggggaggc cattggtcct gtggaggttt gatgccccag 40680 
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tgttggagga tgctggagcg gtggggcagg agtgggtggg taggtgggga gcaccttcat 40740 

agaggcaaaa gggatggggg agaaggcaga tgggatgggg gggttgtgga ggggtaagaa 40800 

agaaaaaaga tgtctctgaa agtaaaaagt acttgtcact aagcatgagg atatgagtca 40860 

acccccaagc cccacagggt ggaaggagag aaatgagtcc cacaagttat tttctgatct 40920 

atacgtgcaa tccatggcat acgcagaaac gcaaagacag acaatgagtt gggtgtggtg 40980 

gtgcacatgt aattccatca ttcaggagac agaagcagca gagttgttgg aaatctaagg 41040 

ccaacctaaa gacctacacc caaagaagga caaactataa ggaaaaaggt ggtcgaccaa 41100 

tgtaacatta aagttagaaa tctctcttca cactgtgtag atactgtaca aggaagagaa 41160 

aaggcagcca catcaaaaca gtgtaaatca acgagaaaac cagaaacaac tcaagagaag 41220 

gctgcagggg cctgaattct gttctcagaa cctgcatcaa gccaagagaa tcaaaactgt 41280 

ctgtaactcc agctccctgg gatccaacac ccatttctgg cctccatcag catcactcac 41340 

aggtgtgcac acatacacat caataaaaat caaaaccagg gatgaagggg tagggaggtg 41400 

catgtggatc tgggaggagc tgagaggcac tgggtgaata caataaaaaa tttggtgcat 41460 

ggtggtgcac gcctttaatc ccagcacttg ggaggcagag gcaggcgaat ttctgagttc 41520 

aagaccagcc tggtctacag agttagttcc aggacagcca ggtctacaca gagaaaccct 41580 

gtctcaaaaa aacaaaacag ccgggcggtg gtggcacacg cctttaatcc cagcacttgg 41640 

gaggcagagg caggtggatt tctgagttcg aggccagcct ggtctacaaa gtgagttcca 41700 

ggacagccag ggctacacag agaaaccctg tcttgaaata aataagcatt tgttgctgtt 41760 

acagaaaact ccagcccagt ttccagcaca cacagggtga ctcacaacat cataactcca 41820 

cttccagggg atccaatgcc ttcttctgac ctctgtgggc accaggattg catacagtgc 41880 

acagacatgc acataggcaa aacactcaca aaataaaata aatctagcaa aaaaaatttt 41940 

aactaataat ttaaagaaaa aaataaggaa gccgggggtg gtgtcgcacg cctttaatcc 42000 

tagcacttgg gagacagagg caggcggatt tctgagttcg aggccagcct ggtctacaaa 42060 

agtgagttcc aggacagcca gggctacaca gagaaaccct gtcttgaaat aaataaataa 42120 

ataaaaaata aggccaagta attcttggaa gaatcccaag gggacactaa gtgtatataa 42180 

aggcgttcca tagggctagg aatgaggctt agcgagagca acttcgctgg tgtatgaaag 42240 

tccctcagct gcatgtggta cctttaatct aggctctccc* gaagcagagg cagaaggatt 42300 

tctgtgagtt caaggccagc ctggtgtaca tagctagttc caggacagaa agggcgatat 42360 

aatagaaaca tcntacctag agcccngcca aanaaagggg agacctgaga ccagagagat 42420 

gactcagtgg ctaagagcat tgactgctct tccagaagtc ntgagttcaa ttcccagcta 42480 

aaaatttatt taaatgttta ttacttgtat tattatttaa atttaaataa ataagtaaat 42540 

gggagcctag gtttgagtcc ccaaatcacc aagaaaaaat gttatcattg ctaataatca 42600 

aattaagagc ataagaactt ctttttaaag aattcttatt tattttatgt atgtaagaac 42660 

actgtagctg tcttcagaca caccagaaga gggcattgga tcccattaca gatggttgtg 42720 

agccaccatg tagttgctgg gaattgacct caggacctct agaagagcag tctgtgctct 42780 

taagtactga gccatctcta cagctcttat caggttgata aaatttaatc tcgtggagcg 42840 

ctgagaccaa gaactaaagc tgggagattg aaaaatgcag accaccaagg ccctgctcat 42900 

ttctccagtt ctgatcagct cccgtaccag gggtctaacc aggcctgtgt ctgcttccct 42960 

gagtagacca gaggccccat ctaaacagcc tgcctgcagc agctcctctc tctaggtgga 43020 

cagatgggaa tttcagacca atgtcatttc ccaggacatc aacacagcag ccaaatttat 43080 

tggtgctgtg gctgccacag ttggtgtggc aggatcaggg gctggcattg gcacagtgct 43140 

tgattattgg ctatgccagg aaccagtctc tcaagcagca gctcttctcc tatgccatgc 43200 

tggggtttgc cctgtctgag gccatgggac tcttctgttt gatggtcgcc ttcctcatcc 43260 

tcttcgccat gtgaggctcc ctggggtcac ccagccgtcc ctgctgcctt gactccatgc 43320 

cagtcctggt gctggagtct actgagattt accattaaac agcaacgttt ctctaaaata 43380 

ctattaatta attaattaat cacgtgacaa ccccagcgtc catatgggtg tggaaaatga 43440 

ggaactctac ccatcataca tggcgactat gaagaacaat gtgacagaaa atgctaacat 43500 

catgtgtgac cgcatgcatc agccctgact gctaaaagtg gacaagcccg aagcgaaagc 43560 

ccaatgttct acttctaaat gcatgcacca aacgcctccc acaggaccag aggtgcagct 43620 

ctgatagggt ccttgcctgg catgcatgaa gctgtggaca cgaggcatta ttcgcaagaa 43680 

cattctagct gtctggaggg ccctcaatcc actgtgttcg cgctgttcca gcaccagtgc 43740 

ctcctggggc tgcacctgaa aaaggggact gcttaagagg gctcctacca agcctactgc 43800 

cacagatgca tgatgggaaa gccttctgga agcaactggc tgccaaaggc tctggacaag 43860 

agatcaccct ctactggaaa ggtggtttca gtctaggttc tgtgggattc caggaaatta 43920 

gacaacactg gcagtccaac agacagacga tctaaacttc caaggcacag ctggtagaac 43980 

ttgctgcgga accagacaac aaggtacgag ctactcccat acaacataca aaaaagcaga 44040 

gagagagtca gagacagaga cacacagaga gagacagaga gagagtctaa agagagtcag 44100 

ggtctcagga ctgagggtat agtctactgt agagcatttc cccagcatat acaagaccct 44160 

ggattcaatc tgaacacagg aaaaaagggg gggggggact tcgattatct caaattctcc 44220 

tttttgtgac acacccctaa agtcactgcc tacttccctc accgccatga agtaaagagc 44280 

tgtttgcgct tatgtctaca cagtctcggc tcccacttcc tcctcccctc tgcttctgtg 44340 

ctcatctcct ctgaaaccac tgcagcaagt gacttgtgtt gactgccaca cggaaactct 44400 

cctcagtagc aggcagcaga gcagagctct gtcttctcgg agcttcttct ctcttgtcgc 44460 

cattttctcc cacccttaag taccctatct tctctgtctc tgcttgttga tccttggacc 44520 
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cttttccttt ctatgaacaa aatatctcct taaaggatct cttctagttc agggtccccc 44580 

cgcccccact gtggagaaaa cccagggcct tgcacatgct cagcaggagc tccatccagt 44640 

ctctagctcc atgacttaaa gcatctctgt gctgtcaaat atacacttcc agcccttacc 44700 

aaaatattca gtcaactcct tgccattcaa aatggatgac ctcaaagcca gagtcagcgg 44760 

tgctatgact cccagatcca tccacttggt agcccaggaa tgaactcann nnnnnnnnnn 44820 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 44880 

nnnnnnnnnn nnnnnnnnnn nnnnnnnngg ttactgggat ttgaactcag gacctttgga 44940 

agagcagcca gtgttcttaa ctgctgagcc atctctccag ccaccaccac caccaccacc 45000 

accaccacca ccaccaccac caccccacca ccaccccgcc ccacctgctc attcctgatt 45060 

ggttggttag tttagtctgt gagacaggag ctgtcccttt tctatagtgg aaggtgaata 45120 

agaaactcct gaaagtgaag gcctacaaaa cagccacact tatttgttgg aaaatactgt 45180 

aaatgtgaca tgtaaataca tgctaaaata attcgttaag tcagtgaaca accttaaaac 45240 

acagtctgta gcctgaatta cagacacgac acgagccatg acagaggctg aaataagacg 45300 

cctttgcaag gagaagggca gaagcttcca tccttgctag caaatctttg ttccaagctt 45360 

tatcagattt tattgctttc tctttctgtt tttctttatc atatttgttt atttgttggg 45420 

ggaaagccta atcttcatag cccatgtatg tgagcacttc agcatatgtg tgtgaacacc 45480 

aacagcacac gtgtatgagc accacagcat aggtgtgtga gtaccacagc acatgtgtgt 4554 0 

gagtacgata gcacgtgtat agagttcaga ggagaactga gagagtccgt cttttcctcc 45600 

tactgtgtag gtctcagggg tggaacttgg gctcagcctt ggtggcaagc tcctttatcc 45660 

acagagtcat cctgccagcc cagctttctc tttttctctc tgttatgtct atccactctg 45720 

ttcaaggcta actcactgac tctgagttat cagaactgct tgtgagagca ggagtaactt 45780 

tggacatctg tgctggtagg aacaccatcc ccactcggct tggatgacga aggggaaaaa 45840 

aagcatcacc aaggagttcc accacctcaa ccagcaaata tttacctcct atacatggat 45900 

aggtggggtg ggtgagcctt gtgatttatc gttaggatct catgggagtg attacagctg 45960 

gtctactcca tgaccaaaat ggtgacggtg gctgaccaaa aagaaacagc tacacctggc 46020 

tctagttttc tttctttctt ttttcttttt cttttacccc acggtactaa ggattgaacc 46080 

caggaatgca agagctctgc caagtgagct acattcccag- atctgttttt ccatttcttt 46140 

ctttcctttt agattttatt tcgatttatt tgtctatgtt tgtgtgtatg tgtgtatgct 46200 

tatgtgtacg tatcagtatg ccatgggtat acagaaacct' gagaaggcca gaagagagtg 46260 

tctgggttac aggagtttcg agctgtcctg tgggttctca aatgcagcag caccaccacc 46320 

aatcaccccc acccccaccc agccttcgag ttcaattctc atcatcacaa aaacacacac 46380 

acagacaagg gcctgcaaga tggctcagca ggtaacgaag ctcgtgtcat aagcctgaga 46440 

acctgagttc actgtctgga acccgcgtaa agggggaagg gaagaatcaa ctctatgatg 46500 

ttgtcctctg ctctccccat gtgtgccatg gaatgaacag ccctcccaaa cacacatcaa 46560 

gaataaataa aactaaaatt agcttagtaa cttttatgtt gaaagtggtt tttacatgcg 46620 

tgggcaacaa taacaccgag agtagaaagg caagcatgta tgtcactgaa cagcattgaa 46680 

gaaaaaacaa acacatttcc tgtacatcgt tctgggagtc tgagttaggg tttctatgct 46740 

gggataaaaa caccctgaca aaaattaacc tggggaggaa gctgtttatt tcagctttta 46800 

tgtctacaac atgacctgtc acccagggaa gtcagggcag aaattcaaac aggtcagaag 46860 

cctggaggta gaagctgata cagaagccat ggagctgctt ctggcttgct ctagcacaca 46920 

ggagtactag cctaggggct gtactgccca cagtgggcta ggctctccca cagcaatcat 46980 

aaatttagaa aatgcactac aggtttgcac acaggccaat ctggtagggc cattttctca 47040 

attgaggttc cttcttccaa aaggacttta gcttgcatta tgttgacata aaaactagcc 47100 

agcatattgg gattatagat attctcataa aaaaaagaca tttagattcc cacataacac 47160 

catattcaga aattaactca atgtgaacca gaagctctga aagtaagagt taaaactatg 47220 

aaaaattctt acaaccatcc ataacaaaaa tctgatgccc tcttctggag tgtctgaaga 47280 

cagctacagt gtacacacat ataaataaat aaataaatat ttaaaaaaat atatgaaaaa 47340 

tcaggctggt gagatggctc agtgggtaag agcacccgac tgctctttcg aaagtccaga 47400 

gttcaaatcc cagcaaccac atggtggctc agaaccatcc gtaataagat ctgactccct 47460 

cttctggagt gtctgaagac agctacagtg tacttacata taataaataa ataaatctta 47520 

aaaaaaaaaa aaactatgaa gaactatgaa ctacaagaag tcaggaatag ggctgggggt 47580 

gtaacccaac agaaaaacac ttgcctggcc tgcgtttggt ctctagcacc accaacgtag 47640 

aaagagaaca gcagaggatg agggcatcct gacttgagtc aagtgacaag tgataatcct 47700 

cgagacacca aaatcacaat gataaaagag atcaacaagt tgggctttat ctgaataaag 47760 

agctgtgtcg ttaaatacca cgcaggaagt gaagaggagc tgagtctggt aacacaggcc 47820 

tgaaatccaa gctactgggt ggactgaggg aggacaacag ctagctcaag gcccacctgg 47880 

acgccagagt taactcagag agcagcttgg gtagctttaa tgagactctg cccaggccag 47940 

tgcacaggag agatggctca gtggttaaga gcattactcc tcttgcaaag gacctgagtt 48000 

caattcccag cacccacgtg ggcacttaca atcatccata actttagttt caggggatcc 48060 

aatgcccttt tcacagtacc aggcatgtac acagtgcaat tacatacata catgcatgca 48120 

tgcatacata cacaggcaaa acttacataa aatactaagc agataaatct taaaagaagc 48180 

cgggcgtggt ggcgcatgct tttaatccca gcacttggga ggcagaggca ggtgtatttc 48240 

tgagttcgag gtcagcatgg tctacagagt gagttccagg acagccagga ctacacagag 48300 

aaaccctgtc ttgaagaaaa taaaaaaaaa aaagaaaaaa atcttaaaag aaaaggagag 48360 
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gactggagag atggctccac agttaagaac acttgttctg aggtctacag agtgagttcc 48420 

aggacagcca ggactataca gagaaaccct gtttcgaaaa accaaaacca aaacaacaac 48480 

aacaacaaca acaaaaccac ttgttcttac agaggacttt ggtttgattc tcagaatcca 48540 

catgatggtt cacaaccatc agttgcaggg atccaaggtc ctgtcttctg tgggcaccag 48600 

gcatatatgt ggtgtacata catgtataca ctcatataca taaaataaaa agttttaaaa 48660 

aggaggctgg gtttgtagcg cagaggtaga ggtaaaaaga ctctagcttg tttaatgttg 48720 

acatgaaaaa aaaaagacat ttagattcct gcatcacacc atatccaaaa attaactcaa 48780 

tgtgaatcat aagctctgaa agtaagaata agcctagtat gcactgtaag gctctgggtt 48840 

cactccccag cactgcaaaa gatcatgaaa ccagaaatgc agatcctctg aaccacagca 48900 

tgggaatgta actcagccga tgcagtgctc acctgtcgta tacagagcac aggataaatt 48960 

gattgtggtg gtgcatacct ataagctcac tacgtggaaa gtagaggcag gacgaccaaa 49020 

ggttcagtga catccttggt cacatagaga atttgaggcc agtctggtct gctggtctat 49080 

ttggaatgct gtctcaataa ataaaagaaa gaaagaaaaa gaaaagaaga agtcctatga 49140 

ttgtcttaac ctctgacctc tgtgttcatc aagtctcctc ctcaggaact cactggtcat 49200 

cttgtgaaaa cctaccccag agtctctgtt cagaggaccc aggctccagc tgtggttacc 49260 

acataggatt tttatactag aaaaataaaa tgaataagta tgtatttttt aaaaaggtgc 49320 

agagctggat atggtggtgt ctagttatag catccagaac tgagacagga tagccatgag 49380 

gttgagaaca gctagactat acggtctcaa caaacaaaag taagggatct gagtagatga 49440 

ggttttaatt tttttctttg tgtttgttac ctaacgtgta tggttgtttt gaatacatgc 49500 

atgtctgtgt atcacttgtg tgcctgaaac ccaaggaagc cagaggaggg catcgggtcc 49560 

cccggaagta ttattacaga aggttgtgag cagccatgtg ggtgctggga atcaaatctg 49620 

aaagagccac ctcgggctgg agagatggct cagtggttaa gagcactcaa tggctgctct 49680 

tccagaggtt tggagatcaa atcccagcaa ctacatggtg gctcacaacc atatgtaatg 49740 

ggatccgatg ccctcttctg gtgtgtctga agacagctaa agtgtactca aataaataga 49800 

tcaaaaaaga aaaaagaaac agccacctct ccactctccc tttttaaaat cctcttgcct 49860 

ctgtccctta atgttaataa cacaggtata tgatactatg ccttgtttat gaatagaaaa 49920 

tacacgtgct aaagcaagtg tgaaccttaa atacattatg ctgagtaaaa ggagtgagtt 49980 

gcacacaaga cttttctgct caagagtatc tgtatgaagt attgaacatg tgaactctga 50040 

aatcgggagc tgaggaagat atggggagtt ctaatggcta' caacatttct ttttggaatg 50100 

atgaggatgt tctagaactc aaaaatggtg ataactcagc atatatacta aaactcattg 50160 

aattgtacac tttaaatgaa tgcaataaaa cttgtctcag taatgtggtt tagaagatgt 50220 

acagacatgt gtgtgtgtgt gttaaaacat ttcttggcat ggcaataaaa atacagtttt 50280 

agccaggtgg ttgtggctca aaaaataatg ataataacaa taataaaaat aatgaaaaca 50340 

gaggctggag agatggctca gcggttaaga acactgactg ctcttccaga ggtcctgagt 50400 

tcagttccca gtaaccacat ggtggttcac agacatctgt aatgggatct gatgccctct 50460 

tctgatgtgt gtctggaaac agctacagtg aaagtcattg caaggacttt acaatagtga 50520 

ccatgataac attgaagcta gacttgctac tactgctgag tgtgtctgct ggctctttct 50580 

aaggagtaat gttagctttt tgtcctaaat ttgtttcctt cctttcctct ctccctctgc 50640 

tgttttttct tacccctctt ttactttgct ttcccctctc atctcctctc ttaacagagt 50700 

tgtcctatgc agcccaaatg ccatcttcct gcctcagcct ccccagtgtt gaaaaatact 50760 

ctttccacag gttatgttag gagactggag tctgctcagt cggggaggga gcctgggtca 50820 

agttctgagc tcaattcctt ttctttcttt ctttctctct ttctttcttt ctttctctct 50880 

ttctttcttt ctttctttct ttctttcttt ctttctttct ttctttcttt taagacaggg 50940 

tttctttgta taccctggct gtcctggaac tcactttgta gctggcctgg aattcagaaa 51000 

tctgcctgcc tctgcctccc aagtgctggg attaaaggtg tgcaccacca ctgcccagcc 51060 

ctgggctcaa ttcttaacat tgtggagaga aaagtattgt agctgttctg gccacctgga 51120 

attactttgt ttctgatctt ttgctgcagt caaatccttc tcatccatct ttcctcgtca 51180 

ggctataata tagactctcc ttgcaatact tggaaatgct ctacagtcag ctacatcctc 51240 

agtcctgctc ctatattttt tcctaagctt ccttctaagg tctttattgg tttatgattt 51300 

acacagaaca tttttttttc ttgtctatag catgcgttag agtgatcgtt gccagataga 51360 

ggaaagagaa atgagagaan nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 51420 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnt 51480 

cagctactga ttcctcctcc tccctcctcc ttcctccctc ctccccagcc tcatgctctg 51540 

ctcatcttgg acttctgcgc atgtcctcag cccagacctt ctgctcttgc ttctcctctc 51600 

cccagcagcc ccccagttct cttcctgaaa cttctgaggt actctccatc acctcctttg 51660 

gctcctgctc tgattggtgt cacctgctgg ataggcttgc tcctgactcc actgttcgtg 51720 

tctcaattag ggaccctcac cctctgatat accacacatt tccctagtgt ctccacctcc 51780 

cacccccacc ctatacgcac atacacactt agctgcatca ggatcctaca ccagggactt 51840 

cttacccttc taatcctccc caccggacac tgcccaggga cactggggct ccagagggct 51900 

attgccacac ggacacacag gagatctcat caaggagatg tgcctacccc agagggtagc 51960 

tctcaccatt cacaagcaca ccacttctgc ctccagcttc tactctctcg caggaagtag 52020 

ccagcccggt gccaagtatc cccaactaca tccccaaaat tctcagacac tgccagcctc 52080 

cagctgtcag cctggccccg gctggcgggc gcctgctcct ggcatagcga ctagggtgta 52140 

attagaaacc cgctagctcc ctaattgcca gttctgagct gtccttgtta ccggctgccc 52200 
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gaggcacaca tagaggaaaa ggctgagagc 
tagacctaga gaggactggc atgtggccag 
gcaatctgcc ccgtgggtgc ctcccagcca 
ccagcccggc ccattcattc tttattcagg 
aaagagtctt gggagctaaa gggtcccttc 
ggaggacaag agagagaaga aggaggaaga 
agaggggtga cttgggtgac aggacacagg 
actcttgtag ggtaggtcag cctccaacca 
tgggagacca acacttgtct ctccttttcc 
tggcctctcc agcctctgct tacaaatgtg 
ctacctctag ccaaaactgt gtccacaagt 
ccttgggaag aaatgaagat tccccatgcc 
gcaagagaac agcttctaca ctggtgatgg 
tggttagaaa gagtcacagg agctgagagg 
gcccagggcc cctcctcact tgggggagct 
tgcaagtttc aagtcaggac caaaggtctt 
gcctatgagc ctgttgtcag gggaatctcg 
gttagggttc ctcacactgc ccacaaagag 
gaagaggaag ggtcggttca cagtgaagga 
ggtagctgcg gctgcctcca caccagcctc 
gctagacacc accagattct gctcagagat 
ctgcaggcct gcccagaaca gcagatgact 
tgtctagcat cctggctaaa gggaacactt 
cctcagtcta gacaacactt ctagaaacct 
ggatctgcgc taggtacagg acagaaagtc 
gaagcagggc cacctctggc tcaggcacac 
taggggcacc ctatccccaa gctccttacc 
ctgctgttgc agatggagtt taggcagcca 
atggtacaga gtatcccagg tcaggttggc 
agtgggcatc acgaccacaa agctcatgtt 
ataagaaaag gagagaacat gaggaccaga 
aaaattactt ctgtactttt gagacaagag 
taactggtga agtggaataa taccacttac 
gataatgctt ttacacctct ctgtgtgcac 
gtgtgtttgc tcatgggtat atggagtcag 
ttgctctcca cttttttttt tttttttttg 
aagacaggct ttctctgtgt agccctggct 
gcctcgaact tgcagagacc cacctgcctc 
tgtaactaca catagctccc tcttttttgg 
ttgaaatgac tgtttggggc tggagagatg 
ccaaaggtcc tgagttcaaa tcccagcaac 
gatctgactc cctcttctgg agtgtctgaa 
taaattaatc ttttttaaaa agagaaagaa 
ctgccccaag tgctgggatt acagagctgt 
catgctaatg tatcccaggc tgtcctcaga 
ttcttttaag gtttattttt atcttatgtg 
gtaccgtgtc cttgcagtac cctcacagtc 
gttgtgagct gcatggtggg tgctgggaat 
taagtactct taactgctga gccacttctc 
tctaaaagta ctagggatgg cggatgtgcg 
gttgaacaaa gggctttgtg catgccaggc 
acagactttg actcttctag tagtagtgtc 
tcagcaagct gtacaaacaa gctatgggcc 
aggtggggct actggctttc aaaatgaccc 
ttgcccctcg tctttcaccc tctcccagag 
ctctctctct ttaaacagga tctggttatg 
ctgcttcaac cttctgagtg ctgggattat 
cctttttttt tttttttttt tttaaagaat 
aggctcaggc ctccctgggt accagcataa 
ggatggaaag atggagccag aggatctttc 
cactcatgtc cactgggaga tactgtagta 
ctaagtggag ctgagcaggc tcgtatcctc 
cccctccaca tggctcctcc ctgcagaact 
actaaccaac agaatggatg agcatgtgtg 



tgagccaggc tggcatggag gtagccctag 52260 

ggaccaaacg tggcacagag agggctcagt 52320 

catccatttg cccagaactg tgacgtcaaa 52380 

tggcataaaa atcactacaa aaactttaca 52440 

cttgcctcag tccccaagat tcctggcagg 52500 

ctcctggcag tgttggcatc tccaaatacc 52560 

ttggggacct gaatgtcttc agcaagggac 52620 

tgaagtataa caccaaggcc agtctaagct 52680 

cacccagggt gtctggaata tgtctaaaga 52740 

gagggaccct aagttaggga cttgcctaac 52800 

gccagcccac aaaagatcac cccctgagcc 52860 

tgccttcctc caggccccac cccacctgct 52920 

tccttccggt cccaccctat cccacaaagc 52980 

ctgatccagg tggggactca ggatgctgct 53040 

gaactggggg tagtcttcct ccatgcgggg 53100 

gcctccatgg aagtcagctt tgtcattctg 53160 

ctgttcctgg agctggggca gcgcgctggg 53220 

gggcacgcct atggtgtcct ccatgatgaa 53280 

ggagagggac attcgattca tggctacgct 53340 

gctgagctcc atggtagact gatgttgcac 53400 

cccacgaagg tctgggccct ggaacaattc 53460 

ggtcagtgct gccccaaggc tatgtggatc 53520 

gaacccagcg gttgattgga atctgttaga 53580 

tttttttttt tttttttttt ttttaaatca 53640 

tagaggagca tatcaaatgc tcccatccag 53700 

tggcagctcc cgtactctgc ccagaccacc 53760 

cagttggctg- agggtggcca ccaggtccag 53820 

caccttggtg ggcctctcct gcagcgaggg 53880 

tagtacctcg gacacgttcc actcaaaata 53940 

gttcttaaag gggaaatgag ccacctacag 54000 

cagcacctgg acctgtctgg agtctgggcc 54060 

ccagaaattc agggttagca tgctttcact 54120 

ccctttgcaa ggtgacatgg gaccaaatga 54180 

acataagcat atatgtttgt atcggtgtga 54240 

aagtaggtaa acatcagtcg tcttcctaca 54300 

gtgttgccat ctttttgttg ttgttatttc 54360 

gtcctggaac tcactctgta aatcaggctg 54420 

tgcctcctga gtgctgggat ctaagatgtg 54480 

acacagggtc tcatggatcc caagctggct 54540 

gctcagcggc taagaacact gactgctctt 54600 

cacatggtgg ctcacaacca tccgtaacaa 54660 

gacagctaga gtgtacttac atgtaataaa 54720 

atgatggcta catacttctc tctcgtctct 54780 

acaacaagcc caagtttgtt gtgttttaga 54840 

ctctctatgt aattcagaac gaccttgaac 54900 

tatgggtatt ttgcctgagc atttgtctgt 54960 

cagaggaggg caccatttcc ccctgaactg 55020 

caaaccctgg tcctctgcaa gagaagccag 55080 

caccttgagc ttttcttcct cctatctcga 55140 

ttcatgtgcc tggtttatgt gttgctaagg 55200 

aagcactcaa caactgagct acacatcccg 55260 

tccactacag cctgagttct ctatctgctg 55320 

ttcctgtcct tgcctctcag ttctctccgc 55380 

atagaggagc cacagcaaac agtaggaagc 55440 

agtcagctat aattcgagtt tttttttcct 55500 

tggccctaac tatcttcaac ttcagtcttc 55560 

ggtgtaagcc accacactca gctcacacaa 55620 

ccatgcagtt aggacagcat ggaaatgacc 55680 

tgcctgcagg cgggtcctct gccagtgggg 55740 

ctctctgaac ctcaatgtcc cacagtgaga 55800 

ttcaaggaag aagcaacagg aaggtgagag 55860 

tcaccacggg ctacagagaa gtctggctgc 55920 

ggcaatgctg ggcccggctt gcccagtcaa 55980 

gtgccacaca cctgggaccc cagcactcag 56040 
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acagctgggg cagaagggtc atgagtccaa 
gaacaaacaa aactagcccg tcctgttatc 
tactctagcc aaggagtcac ggttaggcta 
tggcacggag cttcatccaa tcagaatgtt 
atctatgaag cactgtgtgg gaactcctct 
tgggctccgc taaaacccca gcacagagaa 
atattttctt tgaggagatg aacctcaaag 
aaacccttca ggagctacct acccagggac 
ttgccagcca aaggcaaagt ccagactgac 
tcgaagagga tatgacaccg cgtgcatcat 
gtggaagaaa tctttctggg tgaggctcgg 
caggcagagg gcaggagcca tgtaacataa 
ctatnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
ggaaaaagga atcttaaatt agaagatgcc 
ctaatgattg atgtggaaag tcacgaggga 
atataaaaac acaggctgaa caaaccacaa 
tttctgtctc aggttcctac cttgacttcc 
aagaggaaat aaaccctttc ctccccacat 
acagaaacta taactaatat agttggtttt 
acagggtttc tctgtatggc cctggctgtc 
tcgaactcag aaatccacct gcctctgcct 
gcgccaccat tgcctggctg gttttctttt 
tttaaaaaaa aaaatggcca tggcatataa 
tgcccttgcc ataaattatg taaaaatttc 
tcagcactca tgagcctgag gcagaggcag 
tacataacaa gatcctgtcc aaataataac 
aggggccact gcaaagacag tatgacaaaa 
aactgtcctc ctctctgtaa ttttcagttt 
tttgtaattt taattagttt tttaataata 
tatgaacatt ttcaaggcac ttgatacata 
caattacgtg ccaccagcaa tgttagcttc 
gtctattcgt agttctgatc ttgttatatc 
ctcccagttg agtgttctca ttgaatttca 
aaggattaca gtgttaacta ggatagactt 
atctttgcat aaattctttt taatcaaagt 
agggagtagg tctggccact gatgagtgtc 
tccctgtggg aagggcacat ctgtgttgta 
aggctctaag tggtcctgag ctagctaact 
ctattcactc ttctcatttt gcagagcact 
tgagtccttc tattgttttt cttttactta 
caatccaagg tacacttttg ctaaagaatc 
taactgcagc tgatatggta catacatatt 
taatgcgaaa tattgagtaa tttttaaaaa 
gagacattca actctgacac cagcacccta 
agaatctata aataactcac gaagctgaca 
gagaaaattt tagcaactgt tctaatagag 
gtcaactttt aattatgcag acgctgatta 
gctccttcct gctgcctacc tttaaccgtt 
gtaggctttg gaggggcgga caggcactca 
gaaagatgtt aatgcagttt gtagaattat 
tgaattgtat ttacatgccc tgtccctgtc 
cagagagaca acatagttct tatatttaat 
ggttcgtggt gtttcctttg ttcattacat 
ggttagttag gattcctggc acgcggacag 
ttttttacaa aaccgtttgc ctttaagagt 
ggaaatatta tttccctcct gcttaaaccg 
gtagccgatc ttacatgtat catacctatt 
tcgtaggtat gcagttagaa gcctctagtt 
gagtggggat ggccttttgt gtgccccaag 
ctcctctggt ctatcactat cctgccactg 
ggaggatctc tgcttctgaa cttctttggc 
attttgttta aagagcaggt aggctgttag 
cttgccagcg ccttgaccac tgttaacttt 



agcgaacttg tgtaacattg tcagaccctc 56100 

tcagccacag atgatgggcc caaggatcag 56160 

gaagcaaggg aagccttagc tgagacagct 56220 

cagagcaata agctttgaaa cccgacttcc 56280 

cttcccttac gagcagggcc ctggtcctct 56340 

cagttacctg gcacgtgaca aaaactcaat 56400 

aagctgtgtc ctggatagac acagcataat 56460 

cagactttac ctcccagtac caggcctcgt 56520 

ctgtatctca ggttgctcca gcaggaacca 56580 

gtccaccgac actgtgaacc gctcatccag 56640 

gtcaaacttg gtcctccaga aacctgcagc 56700 

aatcagcctn ctgcctgtct tgcctagaac 56760 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 56820 

nnnnnnnnnn nnnnaaccaa ggcaggtctt 56880 

ttgataagat tggcatgtag gtatgtctca 56940 

tggtgtcacc ctgggcagat ggcctggggt 57000 

agcagtagtc ctcaatggct tctgctttag 57060 

ctcagtgaag gcatgtcaca tgagagttgt 57120 

agtttttggt tatgatgtta tatgtcaaca 57180 

ctttttttgt ttgttttgtt ttgttttgag 57240 

ctggaactca ctttgtagac caggctggcc 57300 

ctgcctccca agtggtggga ttaaaggcat 57360 

ttttttaata catttataat gcattttaga 57420 

tataaaaaga agtgcttaca aatcaccatg 57480 

catatggaca tcagtctcaa gcttacaatc 57540 

gaggatggtg agctcaaggc cagcttagtc 57600 

aacagtaata atttcataca tagaactaga 57660 

ccactggccc tgcctaattg tattttaaat 57720 

ctaattttta cataactacc atgtattctt 57780 

gaaacaagct aagtgctaag aatattttca 57840 

cctcagattt gccctccagg tgagcagtac 57900 

cttttttccc taccatctga ttctgtttca 57960 

cctttttatt gtttccctgg gttccaacac 58020 

ttagcagctg tttcattaat ggcacagaag 58080 

tgacaaagaa ctatgagaac atatcttatt 58140 

tcctcaaaag cctctctctg ttcccatctc 58200 

caggccacag tacaggtgtg cgtggttctg 58260 

acaggattcc tgtcttaaca agccttgctc 58320 

gcccttggct ttcccttgat taccagataa 58380 

taccaggtag ctatgtcctg gaagtacgaa 58440 

aatcccattt gaaatgcgcc agggacactt 58500 

actcattttt atatgcaaaa tgtcacctat 58560 

ctctcttcct attatccact aataggtgac 58620 

tcaatactca attttttaga aataattaga 58680 

ctcagttcct gagccttcct ctgccggagg 58740 

ttactcactg tgttgcagtc atttttttct 58800 

cctgccagtt atcagtagtt gagaatgcaa 58860 

ttcagacgac aaattgttgg tgcctgcacg 58920 

ctcagtgctc attagcacat gttccagaag 58980 

aaccagctaa gcacttagag aagctctgat 59040 

tgactaaaat tgagtcattt ggattccctg 59100 

ccccatagca acagataata ggattgtctg 59160 

tttttccttt gtcgaacatt ttcacatgat 59220 

ttgtatccag actagttact tctgataagc 59280 

tgacaccaca gttgtctgat cgtttcccac 59340 

cagtgttttg cacatttcac ccagattatt 59400 

aagctgtgat cataatttaa gcctttctag 59460 

tctggcatat gtttgtctat tacaaagacc 59520 

aaatgaaatg ttgcgtgtgt gatgaacctg 59580 

gctgttgtgt ttcacacagt tgttttctgc 59640 

ccagaaaacc ctgctgtgtg ttccccgcgt 59700 

ctgagaaact ccataaccaa atcagttagc 59760 

agcttgggtc ttacatgtct cccaggtcca 59820 

tgttaaccaa ctcatctttt gctgcctgtt 59880 



84 



WO 02/29059 



PCT/US01/31488 



ttttgggggg tttttttggt tttgtttaag ccaagatcag ttatatggcc caggctgagc 59940 

ctctcttccc agcctctcaa atgttagaat tacaagcatg catccctcag catacctttc 60000 

ctttgctttt tttaaaatag agttttgcca tagcaacaga aatctaacct aactaagcat 60060 

agccgtgcac atggtatgag gaactcacat atgtgtgaat ggaagttcat agagaccggc 60120 

atcactgcct agaggcccct ttcttccttc cttgcagttg tcgtgctagc tgactgtact 60180 

acaaaagagg ttgtctgagg cataagacta ccttcaataa aacatgcaca gacagtttgc 60240 

ttctctgaga tttcagagca gtgactacct tcaataaaac atggacagac ggtttgctta 60300 

cctgagactg cagagcagtt tccaaaaatt ttagacaaag ggtaggatga agaaggctgc 60360 

ggggttttgc acacacttaa ggtgcgtaag taaataaact gagctacact gacaggatgc 60420 

tcgttctagt agccaaccaa agagcagttg aaccaaagca cctagactfcc aaacatcgtg 60480 

gggagataat cttaggagtg ctatgcttct gcgtcctaca agtattatga aactgtctag 60540 

aaagcacccc actggtaatc cctttttgat tatttttttt ataaattcta gtcttggggt 60600 

tttgagtggc acacagacat aatggttagg cttcggtgtg tgctcattca ctttgcttcc 60660 

tggggaccag agtttgcgat gagtcatgtt ccatctgatt tctgtcggat ccggctgcag 60720 

agccatgact cagatgggct tcaggcccag ctgctcagtt catcttctgg ggaatagatg 60780 

acaaggacgg gacaaatgtc ctgacgcaca tttccttctg ttcttgcact tccagggtct 60840 

aacgagagca tcattaccaa cagcaggcag atacgccttg ccacaggcat cttccctgtt 60900 

gtcagcctcc tgaaccactc ctgcaggccc aacaccagtg tgtccttcac tggcactgtc 60960 

gccaccgtcc gggcagcaca gaggatcgca aaaggacagg agattctgca ctgctatggt 61020 

gagccagcct ttctttccac taccctgctg tgcctcacac ctcacatgaa aaggataagg 61080 

ggacaggaat cagcagatat gggcccagtg cctctactca tcctctgagt ctttcctgga 61140 

aagggcaatg catccttggg ccaataaaaa aggtcttctg gctgtaataa aaaagcccgt 61200 

tgagggcagt gagccatatc cctccatgcc ttgtagacag cctatcctga aaatgagcga 61260 

ggagcacttt cttggcttct ttcttcctgc cccagcagct tggaaacgta tccactttca 61320 

cccgtgtttt gttgtttttt ctgagatgat agggcagagt acccaacctc atataggcta 61380 

ggctagtgtc tatcactgag ccaggacccc aacccagcac caccatgcca gtcacgtgat 61440 

gactaggcca gcccctcggt agagtaggca ttgactctct- tggtgtgact aggaactgtg 61500 

ggtaatctct ctccagggcc tcacgagagc cggatgggcg ttgctgagag gcagcagagg 61560 

ctgagttctc agtacttctt tgactgccgc tgtggggcct gtcacgctga gacactgaga 61620 

gcagctgcag ctcccagatg ggaagccttc tgttgtaaga cttgcagagc gctcatgcag 61680 

gtaaatctct gctgttccca ggggcagggc tccagctaaa ggttgtcagt cgccaggaga 61740 

accattcctg cttcccttct tgtaactcct ccctacatgt cgcccggtcc tgcagaaaac 61800 

acaggttgta tttcctaata ttttccctat aagtgacaca aaatcttaaa ttacacaaag 61860 

ggaccaaaaa aaaaaaaaaa aaaaaagccc tagaaattta cttgctcaaa taagtcatca 61920 

aaagttgtgc atcaggccta gcacttgggt actggtaacc ctagcactca ggaggctgag 61980 

gaagaaggat ctcaagtcgg aggccagtct caagtgacac cccatctaag agatcaccat 62040 

tccaaggagc tatttcagag atggtttaat ctggggaccc agattgtgga ttttctgtct 62100 

gttcaattcc atctctctgt gctggcctca tcagacacac tctgtagtaa ctgtgggaaa 62160 

atccgaccca catagttttc cctcagcctt tgacccagag ggaagagcca cagtggagag 62220 

catgagagca gacccttggg tgctactgcc aggtaatggt gtagacactg gagtcttcaa 62280 

cattcatgcc ccaatgcaaa atggtctcca caccagagca tggcattctc attagaaata 62340 

agtaaatgga attggctgtg ttgaaaattg taaagccaag ggtcaagaat gaagccttcc 62400 

ccagcatgtt ttgttttgtt ttgtgtttta ggcagcgtct ctctgtgtag ccttggctcc 62460 

tgccctctgc tacctctccc aggtgtgcca ccatgctggg cctaagcgcc ctgtgcatta 62520 

gtgctccctc gatcctgctc actcttgaga cagtcttcct tctactctgt atccccagat 62580 

aacctagagt tcacttcaga gcccaggctg gcctcaaact tgagatcctc gtgtcccagc 62640 

ttctcaaatg cagtgatatt tacaggccta cacctggctt tccctgatag attcctagta 62700 

agatgattat cctttgagcc atatctctct tctgcttctt cctctcttcc tgcagggttg 62760 

atctagaatt tattctaaag ctgactggcc tcagaattgc catccttctg cctttagnnn 62820 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 62880 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnaca tagtgtcaac tttcaaattc 62940 

tgccttaaga gttctttgtt tatgggaatt tatgggaatg ttccacagaa cccatccagc 63000 

ggagttctgg ctgttgtttt ttaatcttta ttcatcttgc gtgtgtgtgt gtgtgtgtgt 63060 

gtgtgtatgc gcgcgtgctc aacttgcaaa attgcaaaat tcagtctcct ctttccaccc 63120 

tgtaggtcct ggggatcaga ctctgttagg cttggtggta ggtgctttac tgagccatct 63180 

tacaggcccc ccatggacaa ctttttcttt gaaaacctgt ttctggcttg ggtgtgatag 63240 

ctcacacctg tgaccctacc accactcatg aggaagaggt aggaggacta acagaattgg 63300 

aagccagcct ggactacaca gtgagtaaag gctatctata tactcaccac atggcaagac 63360 

cccgttttaa aacactgggc aaggtgaaac aaaagtcaat taatttcaca taaagtcaat 63420 

agcttcatta acggcctagt tatctttaaa actgtatgca ggttagtact tggtttcaat 63480 

tttattactt tttctctgga acatttaaaa gtactttagg ggctggagag tcagttaaga 63540 

acagtggctg ctgccaaagg actggagttc actcccaagc acccaggtgg caatcacaac 63600 

tgtctgtcat ctaattctag gggatctgac accctcacag actcacaggc agtggaacac 63660 

caatgtacat aaaataataa ttaaaaaaat gaaataaaat accaggcaag gtggcacacg 63720 



85 



WO 02/29059 



PCT/US01/31488 



cctttaaccc cagcactcag gaggcagagg caggcagatt tctgaattcg aaggcagcct 63780 

ggtctacaga gtgagttcca ggacagccag ggctatacag agaaaccctg tctcaaaaaa 63840 

aaaaaaaaaa aaggacttta aattgggctg gagagatgga ttaaaagcat tggctgctct 63900 

tcccagaggt cctgggttca attcccagca ctcaaatggt ggctcacaac tgtctatatc 63960 

aacgcaatct aacaccctct tcaggcatgc aggttacatg tagacaaaac atccatatgc 64020 

ataaaataca taagtaaatg agtcttttaa tgtatactag aagctgggtg gtggtgcatg 64080 

cctttaatcc cagcacttgg gaggcagagg caggtggatc tctgagttgg aggccagcct 64140 

ggtctgagta aatagagcct tgtacttcta cttatcacta cagttacatt ttataacttt 64200 

gggccctagt gcttccattt tccactgttt gcttaaccac tggggcctga agcttttgtg 64260 

ctgacacttt tgttcgctaa tcatcaggca accaatggtc tctacactcc atcaccatca 64320 

acacaaacaa aacaaaacac aacactacgg atcctggcat ggtggaacat ctttagcccc 64380 

agtacgtggg cttgagttca aggccggcct ggtctacata gcaagttcta ggatagtagg 64440 

gatagtcttt aaaacaaaac actattttat ttatgaacaa aacatgtaaa gaaagaaaaa 64500 

aaactgcaaa tttatctatg aatgaagtct aagtaatact tcaatattgg aaatagcttt 64560 

ctaaaatatt tttatttaaa gaaaactcag caaattattc aaacaacctt ataaacgttc 64620 

gttataaaag taaagaatta tttgcaattg ccttaagggt ccaaggtggc agcctcttaa 64680 

aattcagaac aatccaagct tcacattcca gttcaacatt tctacagccc taacgtattc 64740 

aaatacctcc attctgacaa ctgtttcccc tcttcttttc ttctaagctg cttagatgtc 64800 

tgtcccaggc ttttcatgat tttagtcatt cacacaacta gcaaacatta tctagggact 64860 

aaaacttgcc agatactggg atatcaccct aaagggggac tgaaagtagc tgcaggctac 64920 

agtctctaca atctcctgaa tgaaatacaa agtagctaat atttaccaaa taaacatgta 64980 

cacctgtgat gattgctagc tgtactagca gaagctaaac actaaatcta gaaactcagt 65040 

cctccaacta gccccttgct cggcttcagc ctcattttta caaacaaggg aaagagtttg 65100 

gaatgttgcc caaagccata cataagtgaa caaaaaggag ttggagtctc caaatgcatg 65160 

gatttgggct agttactttg ccaaccaact cagtaacaac tgagctgaac aggaacactg 65220 

tggtagcaaa agaaactgga actatcaatg gcctctagag caaaaatata tttaaaaaga 65280 

aaaaaacaaa caaggcctgg caaggagact gtgagaagag tgtgctgact gaaattgact 65340 

agttcagcca acaaaagact attccagggc tggtgagatg gctcagtggg taagagcacc 65400 

cgactgctct tccgaaggtc aggagttcaa atcccagcaa ccacatggtg gctcacaacc 65460 

atccgtaaca agatctgact ccctcttctg gagtgtatga agacagctac agtgtactta 65520 

catataatca ataaataaat ctttaaaaaa aaaaaagact attccagtgg ggatggaaaa 65580 

gttaagtgtg gagttaaaat atacttcaac tggtgatgga ctaggtgtcc agagtcgggc 65640 

aaaaggatgc tctgtggtag aggtgcctgc tgtgtaagcc cagctacctg agctcaatcc 65700 

acagaatcca cagcggagtg ggaagagaaa caacgtccca gagttgtcct ctggcatccg 65760 

acgcacattc gccatcccca agatgtcata catatgtgta catactacac actggcgcac 65820 

gcgcacacac actctttttt aaaattcaga cttagaggga cataaaggat ttgctctgat 65880 

atatgttcaa ttgaaaatga ctttgaagat agagggcaga tcgaaggaag ctcagcagga 65940 

aagaattaat aacatgcagg tgaagggcta taaactagtc tgcagagggc cttggctcga 66000 

caaaaaaatc tatggggttt gccggtaaaa taaggaaaaa gttgtcaaca tgaaacacag 66060 

aacactagca agagaggagt gttagcagaa agaagccaac aagctcaaac aattaggtcg 66120 

gctgaaaaat tttaaaatgt cttctgattt ggctactggg aagccactgg tgacttcggt 66180 

cagcgttttc tctctcgtga ccagagagat gtctagtagc aataatgagt taggaggatg 66240 

taaaagaagt aaaacagccg aaaacaagtc caaaaagttt ggggtgatgg agaaagggag 66300 

gaaacagagg ccgccgaaga tagacagcgg catgtttatt tgtcttgttt tcttagatgt 66360 

aaacaaacta aaaaaactcg tgagttcttc tgccagtacc gggttgcctc cagcatcctc 66420 

tgatggtctt agagaccccg ggatgctccc ccgcggccgt ataatttcct ccctgacgct 66480 

ctcccgatcg acagcggctc cctccccggg tcctctttgc accgctccaa ggccgcgctg 6654 0 

ctagggccat cgagcccgct cagggtcgtc tccttacctc gatggccccc tcgctcaggt 66600 

gtcccaccat ggctgcaccg ctaactcccg cgctcgcgct cttgcaccgc ctgagcttct 66660 

ctgccggggt cccgcgggct gctcaacgat tggctagagc aactgtgcgt gccgatccgc 66720 

ccccagcgtg agcgcggtgc gaggggcggg cctagacgcc gatagccacc gcattggcta 66780 

ccgcgcggca ggcagagcac gtgactcttc cgaggccggg ttcgaggcct agtggcggga 66840 

tggcgggacg tgagggcggg gcgctgggtc gcagtgcgcc tgtgtcagcg cggtgctact 66900 

gagttgttcc cccgccagct gtcggaactt tgcccgccca gtcctttggc ggacagacag 66960 

aatggcaacc cagggaacag tcggagctct cccctggtaa ctgctgctaa atatagtcaa 67020 

agcagtgacc tgggtacttc ttcacgcagt gcgtgcccgg cgccggtgcc aggcccagag 67080 

cttggcactg tgggataaac aaggtaaatc agactcagtc tccgccctct tgagttccac 67140 

ctgagagttg tggccgcaag gaacccagcc tcaaggatgg tagacgcgat atgggccaca 67200 

catgtggagc tccagagtgg gggtcaaaaa tcaatcaggc tttcgagagg cgatgcggtt 67260 

tgaactgagt taaagtgtgt gtagaaattt gtcaggtgga ttccagtgag gatagtgatg 67320 

ttcctaaaag cccaaatggc ctatgcaaaa gtattggaga gcctggcgtg ctggctggct 67380 

ctgatctgtt tgtaatccca gcctttggga tgtagaagca gcaaaagttc aaggtcaccc 67440 

ttaacaccgt tgagttcgag gtcaacctga actaaatgag accctgaaaa atcaaaattt 67500 

gggacccagg cgtggtggca ttcgaggtaa aagcaggcag atctctgagt tcgaagccag 67560 
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ccaggctaac ataagatccg gtctcaaaaa 
ggctatatga actgaaagaa agacctggag 
atgaaaatat ttaacttcat agttgctgga 
aggtaaacag gtctagaaag actggaatag 
tgtacaacca caacctacta tagtttctca 
tggtaccaca ccacagatta actctccttc 
agaaatgatt ttggttgcgt gtcacttggt 
cattaataga aagctgaatc ttcatgttct 
accggtggat gaatggaagt cctacctact 
gcaggacaca atttctacag cagagacttt 
tcttcagtaa gtgaatggaa acttcaggga 
catttggtct gaatatctct tttttatagg 
gggcatttag cagttactgt tggttttcac 
ggtaggcact aactgctttc ctcttttggc 
gctgtcgtcc cagtgctgtc actcatagga 
agctcattca gtcctaatgg ggagaaagaa 
cttctcaagt tctnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
tccaagtgcc actttactgc aatgtgtcac 
tctgtttttg agacagggtt tctctgtgta 
accaggctac cctcaaattc actgagatct 
cactaccacc acccatccag ctttctatat 
ttatacgtgt gtttatcatc taaagtcctg 
ggtgaaaagg attacaaacg cttatcaatt 
tagctgcttc tttcattaga attgctatca 
attgaagaag tctcttcctc atcaggtcag 
cagtattgat aactgttctc gcttaggaat 
atgtgctgag ctgtagcaac gaatcttgca 
ctcgcctgca ggaccttcag caacaggtct 
aaccaggtga gattggctcc ctcgccctag 
tcaccttgcc ttactgagca gtcctcaata 
agcaccagcc acaagatggc tatgtaggcc 
ctggtgccct tgaaagtctt cctgatggtt 
agcagctcga gcaccagaaa ggctgtctga 
ccaatgccat catctgattc ccagcttaag 
tcagcagcag cagcaagtcc tgagtgctgc 
ctcggtagct agcctgagct gagagctgag 
gggatttgca gaactcgatg ttagaccctg 
gcctagattc tgaagttgac ctttagccta 
atgtagagtt ggttgttaaa tgttggttgg 
tgaggaagcc tcatctaaca cctcttttac 
tccctggaat gcccggctgt tgactccgtg 
ccgatagaga ccttgcaacg tggtctctcg 
ggcatttttc ctttgaatcc atagagcaag 
ctgctgagag cttcttgtca gcagagcaca 
cccaggccca tgctacctta ggtatgctac 
ccgtgtgttc agtgcggccc ttgccttgtc 
cgctcttctc cctcccattc tgcattcctt 
ttttccggaa tgagacattt ctcttctcac 
acagaagagt ctccaggtgg ttgaaactcg 
tgagctcttc aaactggccc aagtcctatt 
ttggccaggg cacacagtgc tctgtgtttt 
acagtgcttt aaagcatgtt gaaactcttt 
ctgcttcact gatagacttt aagctttgaa 
aggtctgcat ggagcttccc agggaggatt 
ctgtctgctt aatcacacct cccttggctg 
tcggcccttt ccctggtttg cctcttattc 
gattgcaaat ggcatttggc agtgtttaga 
ctagtaaagg ctaaacctac agtgcacagg 
gctatgtgaa ggctgagaat ccctgctttg 
tcagttggaa gcacccttgg tggcatgcac 
aaaaaataga atacaaggaa gagtaaccct 
tgaggtcact tgaagggaag tctagcccca 
cagccttgat caaatctcat ggtatacatt 



aaaaaagtaa taaaaataaa aagggagaga 67620 

atcaaaacag aaaactgagc cgtctaagaa 67680 

gtaagaagtc tggaaaactt tgggcaacta 67740 
tagccatcta ctggtatttt gatctctgtt • 67800 

aacagttcca aagaatatgt ctgggtgaat 67860 

agcatatcaa cagctataga aaaccccaga 67920 

aggatgaaat ctcgattttc tagaactatg 67980 

gactttacag agctgcggca gcatggatct 68040 

taagaagtgg gcttcactcc cgaagtctgt 68100 

gagcgacatc ttccttcctt cttcttccct 68160 

aattttggtc tggaaaatgt tctgccttgt 68220 

agagagtagc tttatattct ttatagtatg 68280 

gtttctccct agtctgtgat tactagaatg 68340 

atgtgttata cttaaggaat gtagtatctt 68400 

tctggtgcag gttgtgtagc tgcccctaga 68460 

ccctggcact tggttagttg agacccanaa 68520 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 68580 

nnnnnnnnnn nnnnnnnnnn nnntttagtt 68640 

cacatccaga gttctgtgtt tgtttatttg 68700 

gccctggctg tcctggaact cactctgtag 68760 

gcctgcctct gcctccagag tgctgctgta 68820 

tggttttcct atggcgtttt aaaatagcca 68880 

gtcccaaaag gagatgagag gggctgctaa 68940 

ctgttcaaaa attaaacctc agagtgggat 69000 

gaattcactc aggccttgtt tgcgtgtgtt 69060 

tgactcctta gctcaagtac atgcaatatg 69120 

aaaaatagaa- ctgacttcca cagggaaatg 69180 

caaactctgt cagcagggac cagctagtct 69240 

gcatggccca' gaagcttctc cgaaccggta 69300 

gcctcagccc ttcccttgtt tattttggta 69360 

aatgactgag gacttgaatt taattatccc 69420 

agtgagacca gactgtgacc agctgttact 69480 

taagctgtgt ctgctgcgcc agatagttct 69540 

cttccatggg ctttgtgtgg ctccagaggt 69600 

gacctaagct ccgagaaggt tgctctgccc 69660 

ctgggctcgt ggtgtgactc aggagtagag 69720 

agaaagaaag gactcctctc ttttcagaaa 69780 

acatggtagg aatctgtttt gactattcta 69840 

gagtcaagaa aactaatgat tacaggagga 69900 

aaaatggatg ttagaagccc agggtaaatg 69960 

tgaaagagaa aacataagca accaacagct 70020 

agataaagag gcattttcac tttgacctaa 70080 

tgtccaggac tagatctgta tctgttgtga 70140 

ccattcagca gttgttgcgg tgccgggagg 70200 

ccgtactggg ggaaattgaa gatggcctgg 70260 

cttaggtata gccggagttc tccttccctg 70320 

tgtttggttc tctcttgcca tctgaattga 70380 

gcccccagag ccttaggcta atggtgtttc 70440 

agggaactgg ctaaagtctg ctgcccatgt 70500 

ccatgggcca tccagtgttg aaattggcca 70560 

caatgggtag gcctttcttt ttcctagtgt 70620 

cctaggtgct tctgtgtatg gctttttgct 70680 

tatttcctct ttaggacaga tatttgccct 70740 

ttccttcctg aggatgtgga gaaagccatt 70800 

tggaggcagc ctcacccgcc tctagcattc 70860 

cctcagtccc tgctctctca actccagggc 70920 

cttttaaagc agtggttttc aactagaagg 70980 

gacagttttg attgttatgg ctgccagcat 71040 

accgcctcca cagtggagag acccaagtta 71100 

gagattaaaa aaggaagctg agggaaccac 71160 

aaggccctgg ttctgtccct agctctgcac 71220 

aatgagctgg tccctcaccc agtgtgccac 71280 

atttagtatt ttttgtggct gccatacctc 71340 

ggtaagaaaa agggtttgaa acatagacct 71400 



87 



WO 02/29059 



PCT/US01/31488 



gatactcgga catggaaaca gtatgtttgg 
ggcaatatca gagagagggc atcagtcggg 
ctggataagc ttttataaat gctggttaag 
ccagagtcta tcacaaacat acacagtttt 
tgtctcttat ctatactttc agatggtaag 
gtgccccctg taagaccctg ggggatgcag 
aggagtccac tttcgttgag actaaggaag 
ggttggcagt gcctgaagct ctgagtgcca 
actgtggccc tgagagtgag gaggtccggg 
actcgtcatt cgtccctgtg gggcccttgg 
gagctctggt gatgactgag atgttctgtt 
ccttcccacc tgacctgagc ctagtgtctg 
cactcacctg ccatgctggc tgctgcttgc 
atgcccagaa aagactaaat ccagtatcta 
cgggcttggt ggcacacgcc tttaatccca 
caaggccagc ctggtctaca aagtgagttc 
tcttgaaaaa ccaaaaaaat aaaaaataaa 
aaagggttgt cactgatctg caggcagctc 
tctcactaaa cgatgaatct gtttccctgg 
gtgctgtgtt ccactccagt gagaacttca 
aatatatagc aagaaatcag tatgactgct 
atatcctgct tccttaaatt tttcattaag 
ataaatacgt cttactgttg gcagtttctt 
accagtgtcc taattcttac tcagtggctt 
agcactgatg caaggcaacc ctgtagcatg 
acctgaaagc agaccactct ggctgtttca 
gtctagcctc aaacaacatc ctcctgcctc 
ggccaccttg cccagcttga gactgcttct 
aagtccaggg cactccagga ggagccgtga 
atgagtctgt ctgcagggca ctccaggagg 
gggaagccgt gagtctgtct gcagggcact 
cactccaggg ggagccatga gtctgtctgc 
ctcattggta gactgtgagg ctacatctcc 
tgctgctcat ggtttccaac tgagacagag 
ggtagtaata gtccagacca caagggacag 
gtacattatt gctgatgacc agtttgtttg 
catacactct gctttttaga acttgctcat 
tgtcatggca ttacagtgag gaaataaaca 
agaagctggt gttaaaagac agtattcaaa 
gagtctcctg ctgtctgtta atttaccctg 
tttggggtag agctttttca ggaaaaaagt 
tagtccaaac tcatgccttg tgttttattc 
gtgctcttcc tcaaaggata tatttaacgt 
aagctttttt tagatgcttt gtaagatggc 
ggaagtcctg ggtttgattc tcagcagcta 
tgatgccctc ttctggccta tgtgagcact 
caaagcactc atacataaaa ctaaattcaa 
cttgagaata cctgtgctta tattataatc 
ctattatctc agcactcaga agattgagac 
aaaacacctg tctcaaaagt aaaagggctg 
ctgcttctcc agagggcctc atttcagttc 
tgtaactcca ggtccatgag atccaacacc 
attatacaga gagagagaga gagagagaga 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn ntctagagtg tttcaggttt 
ctctctatta tgctgcctgg aactttctat 
atccactact tctgcctctc agtactggta 
ttcaagatct tagatttcca aagaagccgt 
aaaacagaaa gaaatcaggt acacttagag 
caagttgtca aaactttgct cttaattcta 
gggctgtagc caacttggtc gagcccttgc 
tggggtttga attgatttga acccagtaga 
aatgacatat tacaaggtgt ggccttgttg 
aggtctctct gctcaagctc tgcccagggt 



tcagagagag cgaaggacct gatagacgag 71460 

ttagacacga gcattccaca gtgagcagct 71520 

gttttgaatt tgcccaattt tgtcaggatc 71580 

ctcaaatctg ctttgcagta tgcccgtgaa 71640 

accctgaggg cagaggaact cagacccttt 71700 

tggacccgac tttgtgttct ctgcacagaa 71760 

ggaactgaca agcttccctt tctggcttca 71820 

tctggaaggc agaaaggatc ctgttggtgc 71880 

agctccggga aatgaggtcc tgcttactgg 71940 

tgtagagcaa tcatcctcac cctcaagaag 72000 

ggcttggagc tctcatcaga gaggacggga 72060 

gcacagagag cacttgaaaa cagattgaga 72120 

aagagctaac tgccctctga tggaaacccc 72180 

aaggctgctt taaagggttg tcactgcagc 72240 

gcactcggga ggcaggcgga tttctgagtt 72300 

taggacagcc agggctacag agaaaccctg 72360 

aataagtaaa aaataaataa ataaataaat 72420 

atgctagcct aggcttttgg ctcgatttca 72480 

aacattccta tggtttctag tagtaatgaa 72540 

attcttagtc ttgtattata attgaaaaat 72600 

tacctcaaga gacatacaat tccacttaca 72660 

actggtgata tataatttgt gaatggagaa 72720 

cctgggatgg caactctgta ttggtttcct 72780 

tcattgagtg ttcttggcac tcactgtcca 72840 

acttcatagc acaggcctcc ttgttagcac 72900 

cttgcagaca gaatcttact ctgtaagcca 72960 

agccttccaa gttctaggtt tataggaaaa 73020 

tactgccatg tctcttcagg ctcacacatg 73080 

gtctgtctgc agggcactcc agggggagcc 73140 

agccgtgagt ctgtctgcag ggcactccag 73200 

ccagggggag ccatgagtct gtctgcaggg 73260 

aaggcattcc aagagcagcc atgggcgtca 73320 

agatgccccg agtgctgtgg ttgtgagcac 73380 

ggaaggactt tgcccctttc cctaaggatg 73440 

atagctatgg ggttttctga ctcatcctta 73500 

gatgagttag tgggaaagaa gacccaagtc 73560 

cctagccatg cccaaggagc agccgttgac 73620 

gtcctgaagg tgcctggcag cagcttttca 73680 

catctgcgga ctgggaactg ggcagcattt 73740 

acaaggaggt gacttgaaag gtttgttttg 73800 

ttagtcctac agacaactct atagttattc 73860 

ctaaaagccc tgtcacactt tgtaaaatag 73920 

tttatatatc aggccttatt ctgtgcatgg 73980 

tcagtggtta agagcatgta ctgctcttct 74040 

acaccagctg ttattccagt tcctgggatc 74100 

gcatgtgcgt agtgcacaga caaatgcagg 74160 

aaaactcttt cattgtctca tgtgacctag 74220 

tagtatgagc cagccacggt agcaacacac 74280 

tagatggtca agagctagag tctgggttac 74340 

aaaaagtgtc tcagcagcta agagcacaca 74400 

ctaataccca caccgagtga ctcaaccacc 74460 

tctggtcgtc tgcataagct cctacactca 74520 

gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 74580 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 74640 

tttttgtttt tttttttttt gagacaaggt 74700 

gtagaccagg ctggactcaa acttatagtg 74760 

ttgaaggcat gtgtcaccac accccactac 74820 

agcctagaaa aggttaataa gtactgattt 74880 

ctgtagaatg tcagcatgtg acatttgtga 74940 

aagagagaag ctgtcaaaag acttgaactg 75000 

atgaagctgt gtgtttactc cccagcactg 75060 

ttcgtatatt tgaatgttta cctcatgggg 75120 

gaggaattgt caatttgggg gtgagctttg 75180 

agaaagggag cctcctcctg gctgtctaca 75240 
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gaggacatag tctcctggct gccttcagat caagatgtag aactcttggc tcctccagca 75300 

ccaagtctgc ctgcacaatg ccatgcttcc taccatgatg ataatgaact gaacctctga 75360 

aactgtaagc cagccccaat taaatgtttg tctttataag agttgccttg gtcatggtgt 75420 

ctcttcataa caataaaagc ctaactaaaa cacattcctg ctgggcagtg gtggtgcacg 75480 

cctttaatcc cagcacttgg gaggcagagg caggaggatt tctgagttcg aggccagcct 75540 

ggtctacaga gtgagttcca gaacagccag ggctacacag agaaaccctg tctcaaaaaa 75600 

aaaacaaaaa caaacaagca aacaaatgcc agcatttggg aggtagagtt aagaagattg 75660 

ggagtacaaa gtcgtctcag ctagtatgtt tgaggccagc atggaccaca tgagacgttc 75720 

tcaaaacgaa agaaacgaat gaatagataa acatttgagt gtccagtttt ttcctttctt 75780 

tcttgctttg tttttggcgg tgctgaggat taaacccagg accttgttca tactaggcaa 75840 

gcattctcca ctgaggaaca ccctggcgag tgcctagtct gtctgtctgc ctgcctgcct 75900 

gcctgcctgc ctgcttgtta tgtgtatgag tggtaacctg catgtctgtc tgtataccac 75960 

agacatgcct ggtatctgca gaggccagaa gaggatgttg gatcgcctgg aactgggatt 76020 

acaaatggtt gtaagctgcc atgtaggtat tcagaattga acctggtgct ctgaaagagc 76080 

agccagtgct cttgttgttg gttttattgg gggcaggagg tagttatttg gttggttggt 76140 

tggttggttg gttggttggt tttcttgaga cagggtttct ctatgtagcc ttggctgtcc 76200 

tggaacttgc tctgtaggct caaactcaga gatctgcctg cccctgcctc ccgagtgctg 76260 

ggataaagtc atgtgccacc aactccagac aagcagccag tactcttaac cactgagcca 76320 

tcattccagc ccttctttgt gttttgagat ggtcacaaag tacaactcag actgagctct 76380 

tgatcaccct ccctcagcct cctgactgct gggggttaca ggtgtgtcac tgtcctcaat 76440 

tctgagtgtc agatcttgaa aacccattct cgtgaccttg atccttaaaa caaaccctgg 76500 

gagaatgagt tctgataact atttctcact cctcttcaag aaaaggaaag ccagagaaag 76560 

99999 i 3 ( 3 ( 3 ( 3 c aagccccaga aacattgata acttgcccaa agttacacag caaaattcag 76620 

acagcctgca catctcagtg gccatctgtg ccatatccac cctgcccttc tctgacctcc 76680 

ccacctccat ccctacagac cttgcagttg agatcagagt ccaagccgta tcgtaagatg 76740 

gccttaggat ctgacatcat ggggactctc acggtcctgt cctcgtccaa atgaaaatcc 76800 

tggagggtcg tctttctcga gtcaaacttg gttacccact gccctggaag gaaacagatg 76860 

gagcatcctg agccactgtc cccagaaagg ccacaggtcc acgctgtgcg tccactggcc 76920 

aaagcaacct gagctgtcag cagcaagaac acaggagccg ctgggtccca gcatgtgtgg 76980 

cacagaccat aggctccatg caccacgggt tctggctatc ctcctgtagt aaactcagaa 77040 

ataagtgggt gttctctctc tgacttggat caccacgctc cttctgttta aagtggcctt 77100 

taatatgctg gtgtgtggta cgtgcctgct ctcctgtccc ctggggactt ggagtaggaa 77160 

gccccagggc tttcctctaa gttaggatcc actcttgcta ctactccata agatggtcac 77220 

aaagcaacgt aaaatggaaa ttaatcaaac cattcctgcc acaagaataa aacagatctc 77280 

aggggaggcc tgtggaaggg tctcctgagg ccttaccact gtctagaagg aagttgacag 77340 

cagttcttga gcaggggtgc gactccagga gttgggggct gctctgagag caggacagca 77400 

tgtattgtag agtgtctggg agggagctgt gttatcctta ccgtgaagat gaggacacgg 77460 

gctcatgggg gcagagccag gattaaacct ggtctgattc aaaaagccag agatctgtgc 77520 

ccagccccac gcagccattt cactggtcaa ctaattcaga aacacttggt ctgatatgct 77580 

catatgctac aagcactgtg gccttcagat ctccctctgg cctggtacct gcattcaggt 77640 

tcaccaccat caccacacac acacacacac acacacacac acactcggcc agagacaagt 77700 

ggggaagccc tcacccttga agtaagccac gccaaggaga aggatgctga gggcactggg 77760 

catttccctc gtggaccggg caatcttccc tttcatctgg gcctgcaccc agttgttaat 77820 

ctcctgaagg tctactcgag ggttgcccgt gaggatccgg ggcctggtcc cataggactt 77880 

ctccagaggg gcaacaaagc tggatttgac tcgaagttct tgagaggaaa cagatcaaag 77940 

atgagagctg aatcagcacc ctcactttga aagcatgcca gaccccagct tcctgctcag 78000 

catcttcctt tgacttgctg gggcatctgc cggcttgccc agaccctggc tagggaacag 7 8060 

tggattccac cgtttgcatt ccccgtccca ggccctcctg ctgtctccca gagcccactt 78120 

cctctttctg ttcctctgtg gtctcactgg ctctttcctg cccaccagtg ccaggcctcg 78180 

cctgagcaca cacagcctat tgtttagaca tcatggaagc atacagacaa cccaggccaa 78240 

tgaagcaact tcacgccagg cataatgggg cgtgcctgcc cttcagaagc agaggcagct 78300 

ttatgagttg ggggaccagc tgagactcta tagactttga gaggtggggt gggggtgggg 78360 

ctactgactc ctctcaaaca caattctgga agcactcttg aggttcttct caggggcagt 78420 

aacagaggca aggagctcct tgtaggtgct gtggatgtca gggttggtga tcaggtcgta 78480 

gtagagagcc cggtgaatga cagactctgt tcgatgttca gctcctgcca gagagaaaag 78540 

gatgccaagc ttcataactg cccgtgaggc ccgcatcagg atagggacgt tagacatcaa 78600 

tccttttgtc ctctgagagc ccgaggaggc cgatattgca gatgttttag ctggacaaga 78660 

tcttcagggc gtggaaagaa ataatgaccg ccttgctagg aagagctcta agacagggca 78720 

aggttatcag agctacagag agaagagtgg gatgtggtcc tgaagttctc ccatcgtaac 78780 

ctaccctgtt ctgaggagga gccagctctg ctcacggcag ctgtacccct agaacctggt 78840 

taaatgacta aaacacgata ggaggccact taaggaacca aggtcgagtg ccacttacaa 78900 

agtggtaggg attgtgtgtg tggcccccac cgcccctttc ctgttcctct gacggcggca 78960 

gcatggaaac tctgagtggg ggaaattcag gtccacctgc agccttcttc agttgacact 79020 

cacccagaga aagggcagag agggccgtgg ccacgctgag tggagacagc aggac'gttgc 79080 
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ccgttgggct 
ctgccagctt 
tgctgtccgg 
agtacatcac 
gaatgagtgt 
gcaacacaaa 
tttaaaataa 
gttcaaggac 
acactgggga 
tcagcgtatt 
ttgactacta 
cacccctctc 
aaacccaccc 
gttctgctgt 
gaggaatttg 
gcttttttgt 
aatgcttggc 
tttttagttt 
gacaggattt 
cttagattca 
ctgtacctac 
gtaacatgtg 
gaataagtga 
acttaaagcc 
cccaccccta 
tattcttgtc 
tggatccagg 
acctcagagc 
aggagtagca 
ccctcaccca 
gcagagtgag 
tcttccgtcc 
atcaggagac 
tcaggcccca 
cggccagctc 
atagacttac 
ctatgttaaa 
tagccttgaa 
tgagccatca 
gggtatgaac 
nnnnnnnnnn 
nnnnntctta 
tgtgtcttcc 
gagggggtag 
taaggaagtt 
gtgagatggt 
tctgtgggta 
tagtggggcg 
caaggggctc 
actcagtagc 
acgccatccg 
gccgagagat 
aggccaggaa 
gggccccttc 
ctgacctgcc 
gtttcgggga 
ctggatgtgg 
gctggtcggg 
ggcctggaag 
gggagtaccc 
ttctccaggc 
tgggcacggg 
aaggccttgt 
tgctggaggt 



ggcactggat 
gttcacaggg 
gactggggag 
ctcctggttg 
gtgctctctg 
agggcttgcc 
aacttgtagg 
ctgggttcag 
ggcagagatg 
ccatctcagt 
gcctactctg 
tgtccccttc 
tcagaccagt 
gacgccctcc 
tcgttttctg 
tttgttccca 
cacccagctg 
tctgtttggt 
cgctatgtga 
gagatccacc 
caacctcaaa 
tgagggcctg 
ctggctagtg 
cacaatccag 
cttcctccgt 
tcgatgttaa 
gctgcctccc 
tgctggggac 
ccagggcctg 
caccctcccc 
cccttcccct 
ctgacctgcc 
ccaccttccc 
agacaatgct 
ctttggtagg 
agaacatctg 
gacacacaca 
gttgcagcta 
tgcctgactt 
cagggccaca 
nnnnnnnnnn 
tttttagctt 
atttgcttct 
gactcaagaa 
acaggtgggc 
gcaagatttg 
aggatgacca 
gagccccaga 
ggagactagt 
cgaacttgat 
ttccatggct 
aggggcgtgg 
gccccaggca 
gaggtcccgg 
cagaccatgg 
ctgcactggg 
cggaacattt 
tgcggaactt 
actgaagttg 
agtgggagaa 
tgtggcagtt 
tcctagtagc 
tggaccggga 
ttgctgtgag 



ctcaggcggt 
accttgaaga 
ccctgggtta 
ggttggtcct 
actccacaac 
agacctgaac 
gctggggaga 
tccccagagc 
ggcaggtcct 
gaggggtcct 
tgtctgtttc 
cctctgcagg 
cttccctatt 
ctgtctttcc 
acctgttttc 
gggctttgga 
tacctcccgt 
ttttggtttt 
ctctagctgt 
tgcctctgcc 
ggaagtgtcc 
ggcttcaccc 
gcaagacagt 
gtctaatatg 
atagtggcag 
agtaaccaag 
tgcactgtct 
gttctggctg 
catcccggaa 
tgccactgca 
ccctccagct 
cctgcacttc 
atctaagctc 
ctagcacaaa 
atttggaacc 
gtctggtcct 
cactttcttc 
ccctacttca 
gactcccttc 
ctgccnnnnn 
nnnnnnnnnn 
gtttgttttt 
tctacctaat 
gggattctcc 
agagctcggc 
ggccctcagg 
gtaggtgaga 
ctagtcaggt 
ggtgttggag 
gccagaaagc 
tcacccgagg 
acaagccagg 
cagcagggtg 
gcaccgggtg 
catcctgggg 
ggctgcagct 
tcgtttcgct 
ggggactgac 
gaaccttttg 
tggatctagg 
tcctcaaatg 
agtgtcagtg 
attctaggtt 
gaattaagga 



acagatcgta 
aggggtcctc 
gaatacaagg 
ctagtccctg 
ctgtgtgctg 
tgtggtctga 
tagcacagta 
tgggcacggg 
ggggctcatt 
gtttcagagg 
tgtctgtctg 
gaacttcctc 
cagcttgctg 
tgacaagcca 
agtgtcagcc 
tcaaagctgg 
ccctgttgct 
gattttttgt 
cctgggactc 
tcctaagtgc 
atggatagag 
ccaacagaga 
attgtctaag 
cccatctgcc 
ctcctctgga 
gcctagaaga 
cttctgctgt- 
ctgccgtgcc 
ctacaagaga 
cattgcacac 
ctcagcccca 
tccttgagct 
cagcacaggg 
cacacaccaa 
ctgtctcagg 
gctatccacc 
taggtaggtt 
tttgcctcct 
ttctatttca 
nnnnnnnnnn 
nnnnnnnnnn 
cttacttgag 
aaagtttctt 
cataagctgt 
tctgcctgac 
agttggctct 
gatgagggaa 
gggggtaaac 
aagacctcta 
cccaaactgc 
tggccccatg 
aaaagttaca 
gaaaagctag 
ccaggatcag 
actccttgtg 
gctgcccacc 
gatacacagc 
aaagcactga 
gaatggaact 
tctgggagaa 
gtcaccgacc 
ggtgagtgta 
cctcccccat 
cctgagaaaa 



gccgaagttg 
ctcctccacg 
accagtaggg 
gggccatgag 
ggaggtgggg 
gaacctgaag 
gataaagtac 
ggtgcatgct 
ggccaatcag 
gcctgaggaa 
tctgtctgtc 
accaccacca 
gctggtccta 
tcccctctga 
tcttccttat 
gctcttacat 
tttcggtttg 
tgtttgtttg 
actatgtaga 
tgggattttt 
ttcagtacta 
agggggagtt 
gtcactaagc 
ttgtccttgt 
tcctgaaagg 
gtgaaggcca 
cccacctacc 
cgagcagggc 
aacaagagag 
tgcacaggga 
agtggaccct 
gtgcccccat 
aagacccagc 
ggcttttccg 
atgggagcag 
aatagttctc 
cttgtgtatg 
gagtactaca 
agagcaattc 
nnnnnnnnnn 
nnnnnnnnnn 
acctggggag 
ggtgatgttt 
tccgtttggg 
tggcgtgctc 
gttggccctg 
ccagaacaga 
tagatgactt 
atgtgttgta 
taaacccaag 
gctgcgccaa 
gcacgctgga 
atcccgattc 
agaaactgac 
gctggcgctt 
ccgaaatctg 
ctactctctg 
ggggcggggg 
ggtttgggtt 
attgacctta 
caattaatga 
cagaaaaggc 
ctttggaatg 
gtgggacttg 



gagacagctg 
ggctcgcccg 
aggcacagtg 
tctgaggtca 
agtgggaagg 
cctggcccac 
cagcatgcaa 
tataatccca 
cctgaactaa 
tgactctggg 
tgtctgtctc 
acccccaaag 
gtctgcctag 
ctagacccga 
gagactttct 
acgttaggca 
gaggactttt 
tttgttttga 
ccaggctggc 
agattttaat 
catcatgtgt 
agtaggtgag 
cttaagccac 
gtgacatgac 
agagggaaga 
aagcccaccc 
caccctactg 
tccagtccag 
caaacgactc 
caagagtcag 
tgacttgagg 
gttggttcct 
agcaggctct 
tggaggcaca 
agcccaggtc 
tgaccaaagc 
tagcccaggc 
atgccaggtg 
ttagttaaga 
nnnnnnnnnn 
nnnnnnnnnn 
gggaggtgta 
gggggggggg 
tagtactatg 
tgaggtaaag 
taccttctgg 
aggtgaaagt 
tctggaaccc 
aggcctctga 
caggagcggg 
tcaatgagca 
aagataatac 
tgccggaggg 
tgaaacctag 
ccttcacggc 
ttcgggaccg 
gagtaggggc 
tggaaaagag 
gtggatgggt 
gctctttgtc 
tcacccaccg 
tgaatcggga 
gagcagatgt 
agatatctag 



79140 
79200 
79260 
79320 
79380 
79440 
79500 
79560 
79620 
79680 
79740 
79800 
79860 
79920 
79980 
80040 
80100 
80160 
80220 
80280 
80340 
80400 
80460 
80520 
80580 
80640 
80700 
80760 
80820 
80880 
80940 
81000 
81060 
81120 
81180 
81240 
81300 
81360 
81420 
81480 
81540 
81600 
B1660 
81720 
81780 
81840 
81900 
81960 
82020 
82080 
82140 
82200 
82260 
82320 
82380 
82440 
82500 
82560 
82620 
82680 
82740 
82800 
82860 
82920 
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gctgtgcatc agctctgagc gaggagcctc atagtcttct ccggtgcctt caggttattt 82980 

cgctgcagat ggagttgata tgctgtggaa ccagacattg gcccaggcct gggaccttct 83040 

ctgtcaccat ttggcggtaa gactctgaag ggagaggcca ggtagtaagg gagcatgtcc 83100 

aactcaaggg cccaacctct ctcttcagtg ttctgtcctc tgacttttcc acaaagcccc 83160 

ctgaaaacct atcctctcag acttggattg agttggaggg aggttttgac tggctagcca 83220 

ctcctgggca ctgcccaagg agtttggttc tccccacaaa cctccagctg atcataaaaa 83280 

aaaaaaaaaa aaagccagga atgaaagcta gggtatgcta tgcaaatagt gtggcttggg 83340 

gtaagagaac ctctggtcca gggctgctca tgccccctag ataagggtca gcagaaaggt 83400 

caggattgga ggcagtccta aaaaatgctt gggtaatata aagtgaataa ataaaaaata 83460 

aataaatact aatttttaaa aagctgatac ctggaaggat gaggcagaga gtagaaaaaa 83520 

catgcgtggg tgtccctagg ataaggagct gggacttgtt gggcacaggt catgcaaagc 83580 

ctgaaccttg aaccttgcct gcaggtagtg agctgcctca gcaccgctgt tgtgtctggc 83640 

cactatgtgg gcttctctat ggtatccctg cttctggagc tgaactccat ctgtttgcat 83700 

ctacggaagc tactgctgct ctcccataag gccccatcct tggccttcag agtaagcagt 83760 

tgggccagcc tggccaccct ggtcctcttc cgccttctgc ctctgggatg gatgagtctg 83820 

tggttgtccc ggcagcacta ccagctgtct cttgctctgg ttctgctttg tgtggctggg 83880 

ctggtcaccg tgggcagcat aagcatctcc acagggatcc gaattctgac caaggatatc 83940 

ttgcagtctc agccctaccc gtttatcctc atgcacaagg aaaccaagac acgtgagcct 84000 

gttgccagga acacttccac tctcagtctg aaaggtgtgg aagttttctc ttctgtcagc 84060 

ccccagggag gtggggctgg gaagaggaga tggtagccca ctgcatagtc tactatgtag 84120 

caaggactag actgtatcat cagagagaga gagagagaga gagagagaga gagagagaga 84180 

gagagagaga gaacattgta tgagatctcc attacagtca ggaaatcagg agatctaaat 84240 

aactttaaaa gtcccacagt ctttacatat tcttaaaatt tcaatctctt taaaatatcc 84300 

atctctttta aaattcaaag tctttttaca attaaaagtc tcaactgtgg gctccactaa 84360 

aacagtttct tccttcaaga gggaaaatat cagggcacag tcacaatcaa aagcaaaagt 84420 

caatctccaa ccgtccaatg tctgggatac aactcacgat cttctgggct cctccaaggg 84480 

cttgggtcac ttctccagcc aggccctttg tagcacacgc* gtcatcctct aggctccaga 84540 

tacctgtact ccactgctgc tgctgctctt ggtggtcatc tcatggtact ggcatctcca 84600 

aaacgctgca tgaccccttc agtcctgggc cttcaagaga' gaagactaga gcctggcaaa 84660 

gtggcacatg ctgataatgc tagcacttgg gaatgacaag cagaaggatc agaagttcaa 84720 

ggccagcctg ggctacaaga gactctgttt caacaaacaa acaaacaaac aaaccaaaga 84780 

agagaaagaa aaaactggac atgacagccg gaacattatc tgacattcat aaggtcctga 84840 

gttcaatgcc aagttggcag tgcctagttt gataagggtc tagccactct ggtaatacca 84900 

tggctgactg aacaccttac ccagcaactt gctgatagac tctgccttcc agcaaaaggg 84960 

aggagcttcg ctgaggagag aacattgaac cctattgtat atgaataaat tgctgtgcaa 85020 

atgatttcat cagtctcttg tgaatgtgat tgctttgagt catttttctt ggctccagtg 85080 

ttatcctggt ctgcagtgtg gtgtggagtt gtggaagctt tgagttggga gggtttcctg 85140 

ttaaggtttc tctggctctt ttctttcctc ccggtttttg ttttgtttgc ctggtggggt 85200 

tctctggtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt tagaagttgg 85260 

cggggggtgg agggggctgg agagatggct cagcggttaa gagcgccaac tgctcttcca 85320 

aaggtcctga gttcaaatcc caacaaccac atggtggctc acaaccatcc gtaacaaaaa 85380 

aatctgatgc cctcttctgg agtgtctgaa aacagctaca gtgtgcttac atataataaa 85440 

taaataaata ttaaaaaaaa aagaagttgg catggatgat gtagtgaaga ctggcattag 85500 

atatctctgg atccccctgc ctctacctct tagacactgt gagtatggaa gtgtaccacc 85560 

gcaccaggcc aggctagaac attctctgat ctacaaatac ctagagtatt attcctctat 85620 

gatcagaaaa cagacccagg gggccacaga aatgtcttag taggtaaaaa cacttgcttt 85680 

caggcctgat aacctgcggt tttttgtttg ttctgggggg cgggagaggc tggctggctg 85740 

gctggcctgg aattcacaga gatccacctg cctctgcctc ctgagtgtca ggtaccagga 85800 

tcacaggtgt gtgccaccac acttggccta actgcctgag tttgagcatc agtactcaca 85860 

tggtactgag gatagaatag actctcacca gctcttctga cttccacatg tgccctgcag 85920 

catgggctct ccttccccaa aggaaaaata aatgtaagaa ttaaaaaaaa aaaaaaaaag 85980 

caaacccagg tcttgtgtga tggctcagca tcaaagctac ctcccgccac agctgaccac 86040 

ctggtgataa cttatagcct tgttatgctc tcctttgacc tccacgggca tgctgtacac 86100 

gtatgtgtgc ccacacaaac acacaatcaa gaaataaatg cagccaggcg aggtggcaca 86160 

cccctttaat cccagcactt gggaggcaga ggcaggtgga attctgagtt cgaggccaac 86220 

ctggtctaca aagtgagttc caggacagcc agagctacac agagaaaccc tgtttcgaaa 86280 

taaccaaaaa aaaaccactt taaatattat ttttattttg ttttgtttat cctggaactt 86340 

ggtctgcaga ccaggctggc cttgaactca cagagatcca actgcttctg cttcccaagc 86400 

acattaaagg atgtcccacc actgcctggc taaagattta ttttttcttt ctttttgttt 86460 

tgttttgttt tgttttttct aaaaaatttt tttaaaaaga accatccctc ctagcactca 86520 

ggagactctg aagtcagggc cagccaggtc tactgagtga gctctagggc agccagggct 86580 

ccacaaagaa accctctctc aacaaacaaa caaaagagaa cagacccaac cagacctgag 86640 

gacacacact tgtaatctaa gcccttgaga ggctgagaag ttcaaggcta gccacaagtg 86700 

tgtggtgcat tcaagagcag cctgggtggg ctacagaaaa agaaagaggg agagagagaa 86760 
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tggttaatga agatgactct ggaaaagtga aactcaagag aaagcccctc agatttgctt 86820 

aagacgagtt gagggtggag aaccgccaaa gcggacgagc cagacagaga ctgccaacaa 86880 

agttcaatcg gttcaggtac attacttcca aaacgccatt gccacatcag gatgcttcaa 86940 

tcagccaaac caacgcagcg actattgact tctgcatttc agagacttcc gtctctgtcc 87000 

agggcaatgt cactttagct ttcctttgca gaaaggaaaa gtccctgcct ctgatgtggt 87060 

agatcctcac acaccttctg ccagatccag acactggtat gactcagcct cggggagctc 87120 

tatctacaga gataagggta caaggcgtgt gtgtttaaag tatgtgttta aaagtacaaa 87180 

gtgagagtcc ctggaaaggg ctccctgccc tcaccatcac cgaaagcaca aaccttaggg 87240 

taatatctga cattcctgga aatgtatgta tgtattcatt atgtagccct gactgtcctg 87300 

gaatggggta taaaccagga tggcttcaca tctcagagac ccatttgcct ctgcctccca 87360 

agaactaaga ttagaggcat gcactaccat acttggctca tgatttactt aactttattt 87420 

tatgttcacg aatgttagcc tgcatgtatg tgtgtgcacc atgtgcatgc ctggtgcccc 87480 

agaggccaga agaaggtgtt ggttggattt cctggagatg aagtcccaaa caactgtaag 87540 

cagtccaatg tgtgtgctgg agatgaaact tggttcatcc acaagagcag tatgtgctct 87600 

taactgtgga ggcatatctc cagcctcaga tttcccagtt aatgtttgct ttcgcaccca 87660 

ggcccatctg cgcatgcgct ggagacctcc tttaccgcct tgagcctcat tggccaattg 87720 

tggctgggag acttgcagat cccaagtggt acaagagaag aataaactgg tgtgctatga 87780 

actcacctct tctctgtagc cattggctga gcatactttg cctcaaccta ccgcccttcc 87840 

ttcccctaat cctaaatctt tgccctctcc aaatgtgctc ctcccccgca gtaatccagt 87900 

ggtcgctggg gctctagaga gatggggggg gggggagcaa cgggtacagc ttaaggcagc 87960 

tgcagcagaa cttttttgct gtatattgag tcttaaaaat tcatataaac tttgtgttct 88020 

gtttctaaat ataaccccat ctgtttcaac acaaaatgca acaacaaaat gtttcaaatt 88080 

gctatttgga ataattaaaa aatttcaata cttgatttaa aaatgcttta actttttaaa 88140 

taaattttaa atgttattat ttttaaaaag ttacaagttt aaaaaaaaga aagatagaaa 88200 

tcacataatg aaattaacca tacgcaagtg aggctcggtg cactggtaca cagttacagt 88260 

agccatttgg agtggaggcc atggcgcttc acattgaatt ttatactttc tttatagaat 88320 

attttttatg cacctatcta ctactgataa caaaacaccc atgagagagt tagaattaga 88380 

catcaattag ctttgatcct ctgtcataac tcgtgtccac tccctgcctt agtcctacct 88440 

catccctgtc ctcttttcta catcttatac tgaatccaca" cactcagttg tttacacaaa 88500 

cacatacatc actgtccann nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 88560 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnag 88620 

tgcctggctc tgttttgctg ttgttgtttt tgtttgtgtg tgtgtttaga tttggtttgg 88680 

tttggtttga ttttgttttt ttttagagaa tcttactatg tagctcaggc tgtccttgaa 88740 

ctcacagaga tctcttgtct ctgccttcca agtgctgaga ttaaaggtat acaccacctt 88800 

acctggcccc tttcatctat ctatctatct atctatctat ctatctatct atctatcatc 88860 

tatctatcta tctaaaattt atctgtgtgt gtctgtgtgt acattcccca gagcctgtgt 88920 

ggtagtcaat aagtaaccct cagaagttgg ctttctctaa tccttggatc aaacttgaat 88980 

tgttaggctt ggtagcaagc atgtttaccc actgagccat ttatgacccc atggcccagc 89040 

atcttccatg ggttctgggg acacaaatgt gtactttgat gtttacagga caagcgctta 89100 

accaaccaag tcattttccc agccccatcc tgactcccat taagtgttct ttcccccaac 89160 

ccaggaccaa atctagagga gtgtccatgc ccaacaaaca ctctgccaag cctctcccct 89220 

tactgctctt ctcccttccc ttccttcatt tcttcgttcc ttcttttctt tctttttgaa 89280 

acaggtcttt tctctgcatc ccaagctagc cttgaacttg tgatgtagct caggctggct 89340 

ttgaactcac agctgtcctc ctacttcagc ttcccaaaca ctgggattat agacctatgc 89400 

taccacacct ggctcatttt tcaaataaat aaaaagaaaa tcaaaaagtt cctagaacag 89460 

tcacaggatt cacaaaaact ttggaaggag actaaaaatg gatttttaaa aaatgcttga 89520 

agcacaaaga gttgttgaaa gaagagagaa gaggaaaagt tagcttagta ggtagaagtc 89580 

aatcaagcct cacaccctga gttcaattcc tgaccctatg gtagaaggag aagagcaatg 89640 

ccggaaacat tatcctctga cctccagacc cgctgtggca cgtgcatgca cacacacaag 89700 

caggagccct tggagggaag tcctagaaat gaatcttact gaagcaggtc tgccaggccc 89760 

tgtgctcagc cattttattt ttcctttgtg tacccgacac gcttccattc tcaaagttgt 89820 

gagtctgaga ggaagtactc actgtgtccc cagtgagctt ctgtcttacc ctgggtcact 89880 

tagatggggt cacttagtgg tagccttggt gtggagaaag agaacacagg tccgagtagc 89940 

cagtagacct gagtctttat atctgcaaag ggtgttgggg cataatcaaa tctccccccc 90000 

tccccggggt cctgatacca ggttgtatta agtgtatgtg catggtcctt ccaagtcttg 90060 

acacatcatc caactccaag tggctttctc atttttcctt gccagtagcc tcttggtgag 90120 

gaaatggctg aggaaaacag agttgcagaa agacagggcc atggcctggc tgcaggcttt 90180 

ctctgagtct gaagagggtc agcgactctg agaaatgaag ctatttctga gtgagagggg 90240 

ccaaagaagg aacacggcag agggagagcc cccgaggaga tggagacaga agccggagag 90300 

ggaccctgtg cgaggctgga ggggaggaag aggggggagg agtgagaccc actgtcatct 90360 

gttgggcaga gaggggctac attcatctgc agtatggtgt agaggggaca gagagtgatg 90420 

gtaacaggaa aaatttgggg ttgagggggg cagcctgtag ggctgggccc cagcagtgta 90480 

cagctaggta gagtacacag taactcccag aattctctgg ctccactaaa tccctgttcc 90540 

gctccgtgca gagtaaaacc cacacagggt ggatttcagt ctcctttgca cccccctcca 90600 
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ccccccctcc acccccagct ctggtcacag ccagtcagag ttgggggtgg ggcagatctt 90660 

gtaaaagagg ctggtgagga catcggaaag tctgtaccct ccactagcaa agtgccagac 90720 

gctccgtgac actttaaatg cctcagataa aacagtgaga gactctcctg gtggcaggca 90780 

agatgatggg tcagggacct cagcgcctct gaggctcaga caccaggata aagaataaaa 90840 

acaccacgga gacccctgtg accccctcgt gcagagggag aatgccaatg tggcccagct 90900 

agctgctgga gcgcaagcct caaggcctgt gctagttatg agtctactgc tgctgccttg 90960 

tcccaataaa cccctttccg cccaggatta gtggacacgc cttgctcaag ccgagtccct 91020 

gatctgccac cttatcacac acatacaaaa atcccttgag gatggttacc atcctgggac 91080 

aaagcctcat tctctgctta acccaagtga cacctatatg gcagatccct gtgtccttct 91140 

cctgatgata acaacccttg caatccaata gaggggaact cgggtttcfcg tcagcttcct 91200 

ttatgctgat agaaatgtac tctgcatgtg gggagcctgc cttgctcacc ctgagacccc 91260 

atggggctgg ctggggcttt gcacatcatt gggactcaga gatgttgact acatgaacgt 91320 

cccacacttg gttgcacaag gcagaatgac aggatgttat gcctggtgtg tgagtgtgtg 91380 

tgtgtctctg tgtgtgtaaa acgcctctct ctggagccct cctgtctgtc tgcctcttgt 91440 

tcaatggctg cacaattgtc ctttctcttt ccaaggacct ctgtatgggt gtgtccttca 91500 

ttcagtgcct ttcctctgtg ggtttgtcct gctagccccc tgtcactgag aaagtcttct 91560 

gtctgtcctt gggttgtctg gctagaacac agacatcatt gtcttttttt tttttttttt 91620 

ttttttttaa agatttattt atatgtaagt acactgtagc tgtcttcaga cactccagaa 91680 

gagggagtca gatctcgtta ggatggttgt gagccaccat gtggttgctg ggatttgaac 91740 

tccagacctt cggaagagca gtcgggtgct cttactcact gagccatctc accagccccg 91800 

acatcattgt cttgcccacg actgctctcc agaatggtgg gcaggaggat gtgacccccc 91860 

acccccaggc accgggacac aacatcttct acacgtgtag gtcttgtgca ctggctttgc 91920 

tttcttcttc caagcaggtc tcccaggaaa tggcacttac agagattgaa gagtttaata 91980 

catgtctcgc tgcctctctt ttcgggaacc ccccagaggg agcagcagaa accagggctg 92040 

gcaggggctc taagctgcct gggcaaagga gcagggggta gcatggagcc ttagccaatt 92100 

tggaaagcac tgtgacccaa gcacattttg cagcagtaat gtcaaattct gccgttcagg 92160 

catgccattg atgtgcacgc tgccacacag aaaccagtga- cacaaaggca cagccttctc 92220 

caccctcctg gtgcttagga actaacggct ctaatgagaa atgagagctg aaaggagaga 92280 

gacgggggcg ggccacagca gcgcaggctg gcactgcgtg ttggaggagg ctgacccact 92340 

tctcgtagag gtaaggggcc cactgaaatg tcacttaaat tagccaccac tcccaacact 92400 

agatctcctt tgtccccata cctcagcccc acgcttcttt ctttttttct tctttttctt 92460 

ctcctctggg gcagcctcaa gcccagcacc cactttttag agctgtaaac caccctggtc 92520 

ctagaagccc tcttacgtta ggggatgaca ggaggtagag atcaggaagg agggagggag 92580 

gggaggagga aaggaaaagg gaggggagag agggaaagag atcgagagag catgcattca 92640 

tcacaaagag ccctcttttc tggctttttg actgcactgt gagttattta gccaacaata 92700 

gatgtttatg tattttttta gaacccgtat ttattaacag cctgaaagga gagagacgga 92760 

gatttatata ggaagtgcag tgagttaagg ggggcaatta agagagcaga aagagatacg 92820 

gaacacagac ttgtaaaggg ttttgtaaca tccaatcaaa ggtgcttcag gtattttcca 92880 

aggaagcaga aggtaaaaaa aaaaaaaaat tgtcccatta gaagctgaca ctggatggag 9294 0 

caatggccca ggcggaactc ctgcttgaaa gaaggtgaga agggagggac acagaccagg 93000 

atccgatgag ccagagtgtg gccatagctg ggtcatgagg cccagggttg gaaggacccc 93060 

actaaagtgt gcactggcct ttccttgaca aaggatgcac ctatagctag gcgtggtggc 93120 

aagtggttgt tattctagta cttaggaggc tgaggcagga ggatcaccat gagtgtatgc 93180 

ccagcctgga ctgcatagca acacccagtt tcaaaataac aacaaaagga agtgggggtg 93240 

gggagggcaa catttggaat gtaaataaat aaaacatttt ttttaaaaaa agaaaggggc 93300 

tagtgagtta gttcagcggt taagagcgct gactgctctt ccgaaggttc tgagttcaaa 93360 

tcccaacaac cacatggtgg ctcacaacca tccataagga gatctacgcc ctcttctggt 93420 

gtgtttaaag tcagctacaa tgtacttaca tataataata aataaattct ggagtgaggg 93480 

ggccagagca agtagaggtc ctgagtttaa ttcccagcaa ccacatgatg gctcacaacc 93540 

atctgtacaa ttacagtgca ctcatataca taaaataaat aaataaatct ttaaaaaaag 93600 

aagaaagagg gtggggcagg ggagggaaga agaagaaagg taagaagcta aataaaaggc 93660 

acagagatga gcttcatgtg gaaacacagg cctgtagtcc tggcactcag gtggggttgg 93720 

ggggggctac agtgagagta tcatgagttc aaggtcaact tgggtgagac cttgtctcaa 93780 

aaaatacata ngcnaaaaaa aaaaaaaaaa acatagccag gcatgatggt atacatttat 93840 

agtcccagca cttagaggac tgaggcaggg cagaaagaaa aggaattcaa gatcaggctg 93900 

agctgtatgc agtcctgatc ctatcccctc cccccccccc ccagagacag acagacagac 93960 

agacagacag agagaaacac aaagaaaggg gccttcagat ggctcagcaa ttaaaggcgc 94020 

ttgctattca gaccccatga cctgagctca aagcctggga cccaaggtag aaggcaagag 94080 

ccaactccac agagctgttc tatgatctct atatgaatgc tggggcatgt gcctacacta 94140 

tgttgtgcac acatgcacag attagaaaaa gaggaggaag aaaaacataa gattgtttca 94200 

agaaaagaaa ggctggcttc ttccacgtca gtgtgagagg agggtctggc ccctttgtag 94260 

ccaggtcctt cccagtccag tgggggctga actgaggcag cggaggaggc aataacggag 94320 

ctttcccaac gcagtgtcca gcaaactcaa ctctacagcc tgtcctgatc cacagagaag 94380 

ccttcctggc tccctcacca atgcgggggc attggctccc aggctcctgg gccccccccc 94440 
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acacctgtgg agtgctaggt gatttgctaa 
ttggctcttt ggtaatagac atgcctagca 
gggttgcccg tggctccctg cagctggggt 
cgcaaatggg agagcatgga acagaggctg 
agcttggaag cagcaggtgt ctgggaacgg 
actccagttt tttggcacaa ggaacaagct 
gaaacccgca gttgaggaac agcgggtaag 
aggctggggt gtgaagaaga gcccatgcag 
agaggcagag tccctgagtg gggctccctg 
accccagcaa acaaaatccc atccataatt 
tactgtgtgc cactaaccat cagcctaacc 
tttcttctga gactgtgata gtggcgggga 
gactctccaa ccccagctga gccccttgtt 
cagttctaca ctggttgaga acactcacgg 
tcattttcct gtccccaaag cctagacaat 
tgccaggggc caccccatgt ctccactagg 
aagactgcct ggtgggaggt gagataagga 
agtgctgagc ggaaaatggc agaatgggct 
ctagcatgtg caaggaaggg cctgggttcc 
ggttcatgct ggtattctca acattcagga 
ccatcctcag ctacatagct agcttgagac 
acaaacaaca acaaagcagc agaaggccaa 
ccatctcaaa aagcaagtag catgtatctt 
ctatcaagtc atgtatatat gcatgtatgt 
gatggctcag cagttgagag cacttggatg 
gcacccacat ggcagctcac aactgtctgt 
acacagatat ccacgcacat aaacaccaat 
aaattggaag taaaactctc taaggagaca 
gaagggggag ggcataggtg tgtggtgtgt 
tttggggttc attatgcaat gtatacatgt 
gtgggggcca aaggtctcct cctcaatccc 
tattactatt agtgtgtggt gtgatgtgtg 
ggatgtctgt ggtctgagga tgctctctta 
tctggtagtc aggtttgcaa agctagtgtc 
aaccttattt tttgcattga acatggaact 
gaccctcctg ttcctgcctc ccccntgtcn 
tntccggatt gctgcncacc tccccgccnc 
ccgctggncc ctcccccccc cccccccccc 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
agctctaact gctgttaaat tcacactcct 
caccagcaag gcccaggtcc agcttgactt 
tggaaacgga cggaaagggg catggaagca 
aggactctgc tccttccccc catgcctgcc 
tgctttctga ccgcttagcg gaagacaagg 
gcaatgaaac actaatgcag tgggcttttc 
ttatttcctt tatgtagaaa gtgcatcttt 
aaaagaaatt tctcttatga actaatagca 
tctctctctc tctctctctc tgtctcctgt 
atgaatgcac acaaagtagc ccccccccat 
gcaaatggtg gtgactggga agtctggcca 
aatccctttg cattcaaacc cgggctgctt 
tcctcctcct tacctgtctt cccatccttt 
tacccttaag gacaattata ctcttccatc 
tttgatgaag tcgatattaa agagatgctt 
agaagggttt gggttcgatg tcaaagtttt 
gcttttggct agggtagaaa agagatggag 
ggatgcccag agaaggtgga aggatggggg 
taaaagcagg gatgagctgg gcgctaggag 
tgaccacagc tcctgccctc ccctccccca 
taatcgcttc agaaagccag attggaatgt 
ttattccagg caaggactga cccaaaccga 
gtgccggcag ggcgggacct agctgtgcag 
ctggggggaa aggctaccgt tttatctctt 



tgttgggcaa catttgccca cgtggggttc 94500 

ggagggcgga gcttggaggg gggagtcctg 94560 

gtctggccag ctgaagaagg agccatggca 94620 

tggatgctaa gcaatatggg aggcagtcta 94680 

gcctgtggcc caggcagatt tccagtgagc 94740 

ggctgagccc aagaggcaag tggtgataat 94800 

ggtgccatgg gagcccatgt gctcatgaag 94860 

ggaagccaca catcccctcg agttccaggc 94920 

ggtctcccct tacctaacca gtctcccggc 94980 

tgaggtttat agagacctca aaggctgagc 95040 

ctcccccact gtcttctcta gctgcccctc 95100 

cgggttggga gtgtgtgtga agccctctcc 95160 

ctgcagctca gtaacacagt aacacaggct 95220 

ctctctcagc tccttagaga gcctgttttc 95280 

ggctggtcca tttgtaagct tatctgagga 95340 

ctggcaatgt tctctgtcac tgtagtacag 95400 

aagggatggt ctcccctggg gttcccacac 95460 

gggaggtaac tctgttgcta gagtacttgc 95520 

atccccagca ctacagaacc caggcgtggt 95580 

ggtacagtca ggaagagcag aagttcaagg 95640 

cagcctgggc tatgtgagac tttgtctcca 95700 

ctggcaagag gagtattacg taaagtaaat 95760 

ctttcatttt tttttacatt ctataaaggc 95820 

ttgtatgata tgaaagtagg gggctggata 95880 

ctctttcaaa gaacctgggt tcaattccta 95940 

aattccagtc tcaggggatc tggcaccctc 96000 

gcacataaaa- taaataattt ttaaaaaaag 96060 

aaagggactg aggggaagtg ggaggggcat 96120 

ttaacatgca' gaatacactt ctataaaagc 96180 

gtgggtgcaa gatgtaagct gtgcatatgt 96240 

tctctgcctt attttcattt aaattataat 96300 

tgggtgtgtt aagccctcac ggcaatcaga 96360 

ccatgttcgt gtgggttctg tggatggaac 96420 

tttatctgcc gagccacctt gctggccttc 96480 

tcctgagttg cctggacagc aagtccccaa 96540 

nnntcacang aggacacacn gcttantggg 96600 

ccnagcctcc tgcctccccg cccctcgccc 96660 

cccccccccc cttccccccc ccccnnnnnn 96720 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 96780 

nnnncagaca cacacactca aattaaatat 96840 

tcacatcccc acgctaggac tctaaggagg 96900 

agagcaaagc atcctccccc ctccacacaa 96960 

gaaccagaca acagcagcct agccaagccc 97020 

gtgcaactgg ggaggcaaag ccccagccgg 97080 

ggagcctgtg attatgattt ctgctgattt 97140 

attaagccag atttattcaa tctaaagatt 97200 

atatgttgtt ggaggagcag agatgtgata 97260 

ctgatacata gtggtagcta tgcctaggcc 97320 

gcatgtgtgt gtgtgtgtgt gtgtgtgtgt 97380 

attatttctt ctgtgggatc tccagactca 97440 

tgcaattctt gccttttctc ttgccagccc 97500 

gctgtggcca gccctttcac ctggagtcct 97560 

gcagacaatt atcctcaata actagccaat 97620 

agcaaacacg ggtgttcttt ccttgagtct 97680 

tatttacata aagtcaaata gctccctttt 97740 

aaaatcttaa ctagaggatg ggtgtagagg 97800 

atacttattc tgatgttgct ttaaaaggta 97860 

agggagggtc cctcctcaag ctaatgaatc 97920 

tggaaccagt cagaagtgtc tgcctttgac 97980 

gtctctctgt gaaccgccag cattaggagc 98040 

gttgctcacc ctccactgct cagaaaacct 98100 

tcatggcatc tgccaatcag gaggccaaag 98160 

aaacagctcc gttatggcgc gcagaaaaag 98220 

ggcagatggc ttctctcttt gatgctttgg 98280 
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gccttacctg 
gttctcgaag 
cccccgctgt 
tcggaaattt 
gtagagtggg 
gaaagaggtg 
ctggcttccc 
tattaggacc 
cctggcttct 
ccctaaaggt 
ggacgctcct 
tactttcttc 
ctgagtgaga 
aacaggaaag 
ttacctttcc 
gcttcaggct 
aacaagggga 
agccagacat 
atctgccatg 
atgctggcct 
gctccttgga 
tcttagctgc 
agaccctcca 
tccacaattg 
gcaaacccca 
tgagccctgc 
gatccatgct 
ctcatcctgg 
gattaaaatc 
acaaagccct 
ttctaagtgg 
cccccccccc 
ataattctcc 
gaagcacgct 
ctctcaagtc 
atttacactt 
ctgaggactc 
ccctggcctc 
ggtacaccgg 
aaaggcttcc 
aaggtaaatg 
ccttgttacc 
ctgctgagct 
gtctccccac 
ttcatttgca 
tataagaaaa 
ggccaaacct 
ggcccggcac 
gatgaatgaa 
tgtgtgaggg 
tcctggggcg 
tgtctgtaac 
acgtggtcca 
gcctctgaag 
aaaacaccca 
caaagtgttt 
ctcagagtga 
agattcacgg 
cctctctctc 
ccatgtagga 
tgaacgagtg 
aaggcaagag 
ggttctatct 
cccaaccaga 



ttactgcctg 
ttagaaggaa 
gtttctaaag 
ctttttaatc 
agttccgtgg 
ttgtgggcct 
actctcaaga 
agagggactg 
ccaggagccc 
atggggatga 
gtaccccaga 
tgttgttgtt 
aaaaagagga 
ggcaggcaac 
tgacccccag 
aagtaaacag 
ggaggcagtg 
ttcaggcacc 
cctgccatct 
ggaggaaatt 
aacccacgtg 
ctcaagcact 
gggagtcatt 
aggagacgcc 
gccactgagg 
ctcccagggg 
tttatgacat 
ggaccaggga 
attttttttt 
cctgtgctga 
agtgtgatgt 
aaacttactg 
ccatctccct 
gaagaggcca 
tacacggagc 
catgccgcag 
catcaagcac 
cttacagatc 
gggctgcagc 
taggaaatca 
gatgttattt 
atgcagggct 
gagggagggt 
ccccgccccc 
gcagcttaca 
agaggaatga 
ggccgagaga 
aacagggccc 
tggatcatct 
agatgtcaac 
tcagcttcag 
ccgcagaggg 
gacccctctt 
gtatgtcacc 
aacgaaagaa 
ttcagcatga 
gatgcggagg 
ctggcagaaa 
ttttcaaggt 
cctggggcct 
ataggaaccc 
caggaggcat 
gggaggcagg 
gagaattcaa 



cacttgactt gacctaggca aaaatagcag 



aaaaaaaaag 
agctgttttc 
atagcaggag 
ccacagagag 
gtgcacacat 
tgaggtgtgt 
tgtgtgtgga 
acatgccaac 
gggaccaagt 
ctcagccgcc 
tttggaagtg 
gagggaggaa 
aagacaatga 
ggcttagcca 
ggaagagttg 
gccaggcagc 
tctccttccc 
gagagaggcc 
ggtctttagg 
acagagctcg 
gtaaggttta 
gcctaccctt 
agacctggca 
aacttgcata 
acaactgggc 
agaaaggcct 
aggcgagcat 
tctgctccat 
aattagaccc 
agtggcagga 
tctcttaacc 
cctctggtgg 
gactcgggat 
tgatttattt 
cttcctgcgt 
gctgccttgc 
atgacctcct 
ctctctatgg 
gacgctgcta 
aaatgttgcg 
gtggacgggt 
ggggacacac 
gccctcactc 
gggcttgttg 
aagagaaagc 
atccatgacg 
gccacaagag 
gtccttagag 
agaggttccc 
tcggaacagc 
aagggcgggg 
gggttgatct 
gctggctcag 
aaactttgtt 
caactcactg 
gaagtggatg 
atggctgaga 
ggtttttgga 
gtgcagatgg 
aggcactaaa 
ttgagggtaa 
agtcccaatg 
gatggcagag 



ccccaaacca 
ttcccaagct 
tcccaattag 
cagaggcaat 
gtgtgtcaac 
gcaccccagg 
ggggtgttgc 
aaacaggctg 
gctttgcaag 
acccagggcc 
ctgatgtcaa 
aagggggggg 
ccacaaggtc 
atatagctga 
gacatgggtc 
catgcccacc 
tgggtgccta 
actgggactt 
gacactgaag 
catgacaact 
ggagagcccc 
ctgaactcta 
ggggagcaag 
caagaaactg 
aacagatcct 
cagtctcagg- 
cttctgctcc 
gaactcatac 
cgaaaaatag 
gcgcaggatg 
tctcgagtcc 
gctggtggaa 
gccatgtaag 
acctcccgtg 
ggcacggcag 
caggatgaca 
gccccgtgag 
gggaggctga 
gtaattaagg 
tcatttaaag 
ggcaattagg 
cttccggtaa 
cccacccccc 
cccttaccca 
cagaaaagtt 
gtccgcgcgc 
cttcttgagt 
cacaagccag 
taaagaatga 
cggaccgtta 
tcacccagca 
gctcctgaaa 
ccagcttatc 
tcaagagcag 
cccgtgacaa 
acaacagctg 
gccttgggct 
aatgtccttc 
ccctgtgttc 
cacacaatgt 
aggagtgtaa 
ccagcaaata 
ggacaggctg 



caacacaacc 
ctacagcgtg 
cgtgttgggt 
attcagcata 
gcacatgtgg 
ccttttgatt 
tcacagtgca 
cacactcttg 
acagcaggca 
agctctgatc 
tgcagaattc 
ggacgggacg 
cctgtaacta 
gacccagtct 
tccattctct 
gatgccatcc 
gaggtgctgt 
ggtagagagg 
gcagtttcct 
tgccggctct 
agatgtaagt 
acatggtcca 
cctttgtttc 
cctctgggcc 
tccaggtggc 
atttcacaca 
ccccaaacaa 
agttttcaga 
gttagctgac 
ggctgccagg 
ctgggtttct 
agcgtaatga 
tacacagcat 
aaagagacaa 
caccccctcc 
gcagcccact 
gtctgtcacc 
ggaaatgaat 
agcgaagcat 
agtgtcctgg 
ctggcacggg 
ttgctgctgg 
acttctttcc 
gaaaatcacg 
catattaggt 
atataaccct 
tgccacctgc 
gcctgattct 
tgcttctatt 
ccttagctct 
ttgccactcc 
aacagtgttg 
tccccggtgc 
agtgtggtgc 
ccagtacttg 
tatccaggtc 
gcatccctcc 
ctgtgggttg 
ctggtgctgc 
ggttgtatct 
ggactgtgta 
caatggactc 
tctgagtttc 



cgagatacag 
cggaagtgtg 
gtggctctaa 
aatctttcaa 
aagccctaga 
cttgtggagg 
ctcaaagctt 
gaaacccaaa 
ctagtacatc 
cagagttctg 
tggcttgacc 
agcagagtgg 
ggccgaggcc 
cactaactgc 
tggtgctgtg 
ctcctcatcc 
ttctgggagg 
gtctgcatcc 
ttctccacac 
ctgttcagtg 
caactcccat 
atggatggga 
gcttttccat 
tgacccattt 
tctcctggac 
tgcagtgaca 
ctgtatttcc 
gtgtgggaat 
taccgaggag 
aattacttgt 
gctgcagtct 
tgccgggatg 
atcaacgctt 
cgccagccac 
caatcatcat 
ctctccgctg 
ctcagcctct 
cgaaaaccac 
tccgtaggta 
aggtgcgtga 
tgcttcagtt 
tagagctcac 
gcagctctgg 
tgacagctct 
ttggaagaaa 
tcggatctgc 
gtggcaacag 
caggagacag 
ccaatattga 
tccatgctaa 
gctgttctcc 
ccccaccctc 
gctcaagttt 
tttcaagatc 
caactctgat 
gctgttgtgg 
caaacagagt 
tcccctcctg 
tgtgcctttt 
tgttgagatg 
gactagaagc 
aagagatgag 
tcctggtcga 
ctatggctgc 



98340 
98400 
98460 
98520 
98580 
98640 
98700 
98760 
98820 
98880 
98940 
99000 
99060 
99120 
99180 
99240 
99300 
99360 
99420 
99480 
99540 
99600 
99660 
99720 
99780 
99840 
99900 
99960 
100020 
100080 
100140 
100200 
100260 
100320 
100380 
100440 
100500 
100560 
100620 
100680 
100740 
100800 
100860 
100920 
100980 
101040 
101100 
101160 
101220 
101280 
101340 
101400 
101460 
101520 
101580 
101640 
101700 
101760 
101820 
101880 
101940 
102000 
102060 
102120 



95 



WO 02/29059 



PCT/US01/31488 



accgataaat 

gactagaaat 

tcacactgtg 

gggacacatt 

gagaaatggc 

ctagcagcca 

ctaatacata 

cacacacaca 

ggctcagcaa 

catggtggct 

acagatacaa 

gaaagaaaga 

aaagaaagaa 

acttccaggg 

gaagctgtag 

gtagacaacc 

caacgcctca 

taggacactc 

agacctatct 

catccaactc 

atctgctctg 

gaggactttg 

ctctaccctg 

ctgtgcccac 

agactgagaa 

aaatgtctta 

tttgtgtgtt 

taactaagga 

caggtgtgtg 

tgcatattag 

tttgagacaa 

accagaagaa 

atttgaactc 

agtccttgag 

gcttcagcct 

cttccttccc 

cttcccttcc 

cttcctttcc 

taatcctggc 

nnnnnnnnnn 

nnnnnnnnnn 

cgacaacacc 

ccaattttta 

atagctcctt 

gtcctagaac 

atgatagact 

aaagtcatcc 

ctcaaaaaga 

gggaggcaga 

ccaggacagc 

agtgaacccc 

tgtttaccca 

tgtttgaatg 

ggcagtattg 

gagtttaaag 

tggcctgctc 

ctgtttcctc 

tatactacac 

tctagagatg 

cctccatctg 

atgagacctt 

tggtggtggc 

agaaaggctt 

aataatgatg 



ggtcataagc 

ctagctcaag 

acatgatgct 

catggaccgc 

ctggtagtta 

tgttgggcag 

catacataca 

cacacacaca 

ttaagagcac 

cacaaccatc 

tgtactcata 

aagaaagtgt 

agaaagaaag 

ctataggcaa 

acaacctgtg 

tgtggcctgg 

ccctctgtgc 

tacatcaaga 

ttacatgtga 

actatgtgac 

agagtcccct 

gatctgcaaa 

aaaccctttt 

aagaagtccc 

cattgtgtct 

ctgcctttat 

tgtttgtttt 

tgactttaaa 

ccaccgtacc 

gtaagcatgc 

tgtctttttt 

ggcattggat 

aggacctctg 

acaatgtctt 

cctaaatact 

tttcccttcc 

cttcccttcc 

ctctctctct 

tgtcttggaa 

nnnnnnnnnn 

nnnnnnnnnn 

ggcgccgctg 

aaaccagtgt 

actttgtttt 

tcaatgtgta 

gatggtgtaa 

ttggctatgt 

aataaatgta 

ggcaggcaga 

cagggctaca 

aacagtactg 

ggtgtaccca 

agagaaaagt 

gctggagagg 

cttcctccac 

ttggctccgg 

ccccacccct 

tggtgatctt 

tcctttcatg 

aactttagga 

ggagcctgca 

agtagtagaa 

tgggtgcaca 

acgatgaaaa 



agagtagagg 

gttgtgtgtg 

gtggtctgca 

aggttacaca 

agtatgcttg 

cttacaacta 

tacatacata 

cacacacact 

tgactgctct 

tgtaatggga 

tacattaaat 

aaacgaggaa 

aaggaaagga 

cctgtggcct 

gcctggggaa 

ggaagctgta 

ccaatttcct 

tgatcttgtc 

ggtcactcca 

agagtcatct 

gcctgcccgt 

gccacatctt 

ctctgaggtg 

ggggagaagg 

cacctaaaat 

tccttttcct 

cttctgagat 

cttctgatcc 

ccagctttat 

taccaacttg 

tttttaatta 

cctattagag 

gaagagcagt 

gctatatggc 

gggattacat 

cttttccctt 

cttcccctcc 

ctctccccct 

cttgctctgt 

nnnnnnnnnn 

nnnnnnaaga 

cctccactgc 

ttcaagccgg 

gctttgacgt 

aattaggctg 

aactccactt 

tgtgagtttg 

ttgccgggca 

ttttctgagt 

cagagaaacc 

ccggacagtc 

caaggtgtgt 

ttcccatcag 

tgatggggag 

gaagaccact 

cctcagttta 

ctccctgtct 

gactgtattt 

gctgggacct 

aacttcttgg 

ctttgttaag 

ctctgatgaa 

tggctataac 

taattctctg 



aaaaccacag 

gcagggttgg 

gatgtgttgg 

tgctatttaa 

ctgatcttcc 

actatgactt 

catacataca 

ttaaagaaaa 

tacagaggtg 

tctgatgccc 

aaataaataa 

aattcctaat 

attctgaggg 

ggggaagctg 

gctgtagaca 

gacaacctgt 

gttctctaag 

tcaagatgtt 

caggttctag 

agagatttgt 

gggaactccc 

ccaaaaccat 

gcttttagag 

gacccaaggg 

cggtggtcgc 

ccgctccatc 

gtagcccagg- 

tttcttccct 

ttgagactat 

gctatattcc 

tatgagtaca 

atggttgtga 

cagtgctttt 

acatattggc 

gtgagccatg 

ccctttccct 

cctcccttcc 

ctttcttttc 

agaccaggct 

nnnnnnnnnn 

agaagaagaa 

catccacctg 

tacactgaag 

tttgtgacat 

gccttgaact 

aggaggcaga 

aggaccaacc 

tggtggcaca 

tcgaagccag 

ctgtctccaa 

tggtgtcttt 

gagcagctct 

ctcgggtgtg 

gtgtagcctt 

gggctgatct 

tctaaaattt 

gtattccctg 

cagtttacct 

ggctcagaaa 

cttaaaggtg 

caccctgggt 

cagttagtta 

tcagtggcga 

tgactgttct 



acagaaattc 

ttcctgggtg 

gctgcatcca 

aaactccaag 

agaagacctg 

ctgagctcca 

tacatacata 

aaattctggg 

ctgagttcaa 

tcttctggtg 

gaaagaaaga 

taaaaaagaa 

agaatctgcc 

tagacaacct 

acctgtggcc 

ggcctgtggc 

gacacatgcc 

taacaaaatt 

acatattttt 

gtccaggaca 

cagtggtcct 

tttcctcttt 

aggcaggtct . 

ccagtgctga 

aaggaccaag 

ttactcctca 

ctggccttca 

ccacttccag 

gattcaggct 

cagcctttct 

ctgtatctgt 

gccaccatgt 

aaccgctgag 

ctcaaactca 

gtgtttggct 

tttccctttc 

cttcccttcc 

tttcagagag 

ggcttgnnnn 

nnnnnnnnnn 

gaagaagaag 

agacaggact 

tagtagtccc 

ggtgtgatgt 

tgcttctgcc 

ggtgggagga 

ttggctacat 

cgcctttaat 

cctggtctac 

aaacaaaaaa 

cctaagtctc 

atacccagag 

tgaatactcg 

ccggtggaga 

tgttaccaac 

acatgttacc 

ccctctagtg 

cttgttctct 

tttctaaggc 

tatttctgac 

ggtggtggtg 

ttcaaggccc 

gagacgtgct 

ccccacttcc 



atgccattga 

ttcaaccttt 

tagctaccct 

ggaagggcta 

agctctgttc 

aagctctctt 

cgtacacaca 

ttggagaggt 

ctctcaacca 

tgcgctgaag 

aagaaagaaa 

agaaagaaag 

ccttttccta 

gtggcctggg 

tggggaagct 

agcatcatgt 

atcaaatgca 

acatctgcaa 

gaggagccac 

gactggctgt 

taagggccct 

tggagagcta 

cagcagggca 

actatcgctg 

caggctctat 

tttttgtttg 

gctcactatg 

agtcctgggg 

ccatacttca 

ttctttcttc 

tttcagacac 

ggttgttggg 

ccatctcgcc 

gaatccttcc 

tctagccttt 

ctttccttcc 

cttcccttcc 

tttctctgtg 

nnnnnnnnnn 

nnnnnnnnnn 

acaacaacga 

caaatccaga 

acttgggatt 

agtcttggct 

tcctgctggg 

tcagaaattc 

gatatcctat 

cccagcactt 

agagtgagtt 

caaacaaaaa 

ctttcaactc 

gtgatacggt 

gtccccagtt 

tgggctttgg 

agaattggat 

tgatcaaaaa 

gtgctggctg 

ctctgctgac 

actgagcctt 

ttagtatgca 

gtggtggtgg 

atctagaaaa 

tcccatgaat 

ctctctctca 



102180 
102240 
102300 
102360 
102420 
102480 
102540 
102600 
102660 
102720 
102780 
102840 
102900 
102960 
103020 
103080 
103140 
103200 
103260 
103320 
103380 
103440 
103500 
103560 
103620 
103680 
103740 
103800 
103860 
103920 
103980 
104040 
104100 
104160 
104220 
104280 
104340 
104400 
104460 
104520 
104580 
104640 
104700 
104760 
104820 
104880 
104940 
105000 
105060 
105120 
105180 
105240 
105300 
105360 
105420 
105480 
105540 
105600 
105660 
105720 
105780 
105840 
105900 
105960 
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ccctagctct 
cggaacgtta 
tggctagtca 
tcagcaagaa 
gctaagtggt 
ctacactgtt 
aactggaatt 
gggatatttg 
gtagaggcca 
ctctcgatca 
ttggccttga 
agaggcctcg 
gcttgttgaa 
tctgccaagt 
cttggagctg 
taagtggcat 
atcagctgac 
ggattctgac 
tccctgcatc 
ctccccagca 
atctcctcag 
ttgatgaaga 
ccacccatgg 
gtaagaagga 
ctggggcgga 
tttgaagcaa 
ctatctcagg 
gctcttagaa 
ggagggagtg 
gccaagtcat 
cccctaggga 
ggcctgagtc 
ggtttctggt 
tctctaaaag 
ttcagcgact 
ctagacctcc 
ctcgagctag 
gaaaaaaaaa 
gtggctggct 
ttagctcagg 
tgtagccctt 
acagagcgcg 
gcactgcaaa 
gctggtctaa 
ttggttagtt 
ccaatcagag 
tgggtgtggt 
ctcgtgcgtc 
ggaagagaca 
agcaaccccc 
aaccgcggta 
ggggagggga 
ccgagtagtg 
ttggagaagt 
cgccctttgg 
gaggggtgcg 
cctggcaggg 
ttatgtaagg 
aaaggttagg 
ctggaaactc 
gtcacctttg 
ctgactgcct 
cctctctccc 
cgttcccacc 



tatctaccga 
gtcacttccc 
ctgaggaaaa 
agaagctgac 
ttcaagaagt 
ttgtagatga 
tgtttggcta 
gggacacttc 
gggtggtact 
aaacatcaac 
ggtgggtggg 
agtgaggagg 
acttttcttt 
cagtcccggg 
gggcaacaca 
ctgagaagtt 
ctaggaagga 
ccactaggct 
atccgcagga 
gtggcctccc 
aagtctgaat 
aaagatcatg 
cggggttgag 
ggatgtggtt 
aggcaggaag 
gcccagcagt 
cttccctctc 
atgactcaaa 
aagaggtggt 
tgggttcatt 
aggagccgcc 
agagaagcgt 
tgatactact 
tgagggcgaa 
aagacgagac 
agtgaagatt 
ctctccagtg 
aagtcctatt 
tccatcttta 
cctcctgtga 
taaggacttc 
ctgcattctg 
aatacacgcc 
gcaagtcgcc 
ggtaggcgtg 
cccagacgcc 
cacgttgccc 
acgacgccgc 
gagcggtcgg 
gcacccagca 
accgggtgat 
gagggggcgg 
tgtctggacg 
aactggggag 
tttgaggatg 
gaggtgcatt 
aaggggtcgc 
ggggagggga 
gtttcaccgc 
tacccgcacc 
cacctccccc 
gggaagccgg 
cacctcctgg 
tccctctggc 



atccctgcac 
ttcgcctact 
cggcagctgc 
agcaaatgtg 
ttcttcttaa 
agaaaacacc 
attcagtggt 
tggtcgtcat 
gaccttccta 
agtgctaagg 
aggaggttag 
gaacagggct 
cttccaagct 
acttcgcact 
tttacctggg 
aaaagtgggg 
agagagcaac 
tatcttccac 
gcccctggag 
cactaactga 
aagtaaggga 
agaagcatac 
ggggaagggc 
agggctgatt 
tctccttaaa 
caccttagtg 
atttctttgt 
agtcaagaac 
cccagtcttg 
tcggtttttg 
tcttcattgg 
ttcggagggc 
ttggtaaaga 
tcctttgtta 
attcactaga 
gggcctgtgc 
aagactgggg 
tgtggaaaaa 
gttctactca 
atcagttttg 
cccgcgaccg 
ggaagctgag 
tatacttcct 
taggccgagg 
gcttagagaa 
tgaatgggcg 
agcaatgggc 
cagctgatcg 
ccgtgcggac 
tcaggtgggt 
ggggaaagta 
cggggacagg 
tttaaagaga 
gaatatgaga 
cccgagctga 
gcctctaccg 
tgggcggtcc 
aaggagtagg 
gggttgggcg 
ccccctccca 
ctagtatgtc 
ctggctgggt 
ccaactcttg 
cctagtttca 



aagcacccat 
tcagtattgt 
ctgtgggcca 
ccaactgtgg 
ccccctacaa 
attccgaagc 
tctcaaccag 
aattgggatc 
ctatgaacag 
ttgagaaatc 
agagtggaat 
cctatgctag 
tggctagagc 
aagtttgtcc 
tcttcccagg 
tgagtgggta 
tgaagcagca 
agcctttctt 
cacccacatt 
cagtggtgac 
cagatgttgg 
attttacccg 
agaaaaaagg 
cgtcccctgg 
gacacaatat 
gagcccatac 
ctcagatgct- 
cccaggccac 
tccctttaaa 
cccatccccc 
tctccacctt 
gggactgaat 
tcttccccta 
agaacgtgcc 
aaagatttca 
gggtgacatt 
ccgtgcgagt 
aaaaaagact 
ctcctgttgc 
aggctaaaag 
agtcagagat 
tgtcaccgta 
tctgctcttt 
gttagccaca 
gctcctccag 
ggagtaagca 
ggtgattggc 
gagactggag 
aggtcgcagt 
gtgatctggg 
gggtcctgac 
ggcgctcttg 
gagtcccgga 
ggccagaggg 
ccatttagcc 
gcgctgactg 
ggcccctcct 
gggcggcggt 
gaggttgggg 
gcctaactgg 
cggtagagag 
ggggcgcctg 
gcccctcccc 
gtctccaact 



ggggtttaca 
gtttccagaa 
ccagcccatg 
gtctgctgga 
gaaaccacaa 
tcactgccag 
ggtggatttg 
atgtgctact 
ggcagccatg 
tcaggctgaa 
aaaatcagaa 
ggataatgga 
cctgctcaat 
tggagtgacc 
agtgggttaa 
aaggcaggag 
aagagctggt 
gtttaggctt 
cagccggccg 
aggaaatatc 
ggagaggcgt 
ctatggttgg 
agatggagaa 
ggtcgaacaa 
tctgaacgtt 
ttaatttaac 
ctcacttaga 
aagtttctct 
taagcaattc 
gcctttcaga 
tgaaatcact 
gggtgttaat 
atttttaaaa 
ccttgagaag 
ctaaacccac 
tgtccctata 
gacagtggtc 
tcgggtgttc 
ttcgcgtgct 
aagttccaag 
cagtttaaaa 
agaacttcat 
aaactgtagt 
ccttttcagc 
gcaagggggt 
gaggtgctgg 
cctgggtggt 
ccggtgtgtg 
gattttgctc 
gacccggtca 
ggccacaccc 
ggagaggagc 
caggagtcgt 
ccgggggcgt 
tagggaggat 
ggtcagggcc 
ctcgttccct 
gcggaggcct 
ggggcggaca 
ctgtcttgga 
gcccctagcc 
ggttagtcat 
acggcctccg 
catttggcct 



gaatctgggg 
gtacccattt 
ccaagtgagg 
tttctactgt 
attttattat 
tcagcactgg 
gactccccag 
ggcacctagg 
tacataaatt 
accctgtcat 
gggccaccac 
gaatagggca 
ttcccccaac 
ctgactccag 
aagtcaaaga 
gaccccatca 
ccaggagact 
gggctcagtt 
ccaggacagg 
tcccattcca 
cactcttggg 
ggttccatta 
ggacagacac 
tcagctgtta 
gaactcagga 
acagaagcgg 
aagcttagat 
ttgggggtgg 
agcagctttt 
ctctgattgg 
tccctaagta 
cttagaaccg 
agacgcttcc 
ccgtgggctc 
gagggataga 
ccccgaagac 
cctatacccc 
tgctgcatcg 
ccaccttcgc 
aaggaggggc 
atgccaactc 
tgaccggaat 
ttgacgtaaa 
cattggccag 
ggcctccttg 
cgcccccgag 
tcattcgcag 
ctgggcgctg 
ctctgtccac 
tcccgggggg 
tgcccttctg 
ctggactctc 
ggcagaaggt 
ctaaccccga 
ctggacgagc 
agttcaagtc 
cccggggatg 
tatgcaaccc 
ggaggagtgc 
cagagagaag 
cgggcttggc 
cgctgggctc 
gttaggctaa 
gtcaccctgg 
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ctgttagagt 
gggtcagtgc 
tcagcagcca 
gagactgatc 
ggaatttgaa 
aggcaaagtc 
gaaccaccct 
ccatgctgta 
cttggaactt 
cagatgtacc 
cagccaccct 
tcctgagagc 
aaccctcctt 
atttctgaag 
atgccgaaga 
aggggcacaa 
gaaggtactg 
acccagcttt 
ccctcacctg 
aaagctgcca 
gcagcagctt 
tggggtgatg 
gcccagtgaa 
ctccccaaga 
tgtttactga 
cacacacaca 
cacacacttt 
actccgttct 
tcgagacagg 
tggcctcgaa 
tgccaccacc 
tgggtttagg 
ccatagataa 
tcttcttctt 
caccccctga 
gtcccaggca 
ccaaggcttt 
tgccatttat 
gcaagaaatt 
cctgattcat 
tcctaacaga 
ggagttctag 
ctaatccagc 
cgggctgggg 
cgcttgcttt 
atggctctgc 
caacagaggc 
acattggtca 
gcctcagtga 
tgaagggcct 
aggagcatga 
gtaacactgt 
aacatgcacc 
ggtctgcaat 
ctgctataca 
caaaataaaa 
tccctgatca 
tggcagtacc 
gtatcctaaa 
tggaacgcag 
tagggcttgc 
gtcttacaga 
gtatgagttt 
accatccaat 



aggctagaag 
cctgggccca 
gtgaaccctg 
acctaggaga 
agttgcccac 
tccttggccc 
ctcgggctcc 
gggccccggc 
ggaacaaagt 
cggaatgtgc 
acccatacct 
ccctgctgac 
gctgctgaat 
tacatttttt 
tgctgggctc 
agcaagtgca 
aaagtcagct 
cctgtttttt 
gctgagccgc 
gttgaagaac 
tgcctatcat 
gctttatggg 
agatttaggt 
ttagttgatg 
gacttccttg 
cacacacaca 
cctttccatg 
aagtagtcat 
gtttctctgt 
ctcagaaatc 
gcccggctgt 
ggtacttacc 
gtaggcagga 
ctcgtggatg 
ctactcaccc 
agctcgtggg 
ccctccacct 
tttctgcctt 
gtcaggggag 
tctcacgtat 
agatgggttc 
cctgttcccc 
cccactgcag 
aggctgagtc 
ctttcctcct 
tgtcctcatc 
aggcacaaga 
gagaggcacg 
aagacacgtg 
ccaggaagta 
gccacctctg 
gggaaaacat 
aagccctggt 
cccggcactc 
atgagttctg 
taaagcattg 
taatacaaca 
aaccattgtg 
ataaggattt 
ctgtttgttt 
ctaagtaaat 
agaaaaattg 
gtgactgttt 
tgagtggaaa 



ctgtcatggt 
ccccgccccg 
tggcctgaac 
aggaagatcc 
cctagtgtaa 
ttctatctgg 
cagccctcta 
ttttactgct 
tcagacgtgg 
aagcggaatg 
ccctcccctc 
cttccaactc 
tccctaagaa 
tttttaactt 
cttagcaggt 
tttagaagcc 
agagccaggt 
tttttcctcc 
agtagttctt 
tgttgccctc 
ccggaaggtg 
agggaaaccc 
taaaggcact 
tctgtgtggc 
taggcctgcc 
cacacacaca 
aggtccaaaa 
ttgtgtattt 
atagccctgg 
tgcctgcctc 
atttacattt 
cttacacctg 
gggcattaaa 
gaaacgaaac 
agctcagggg 
aggctagtcc 
aaagcaaaac 
tgctttccag 
atgggttgta 
gggcttggtc 
agggggtaca 
atttgtgaga 
tctaagctga 
ggctcacaca 
cccttccctg 
caacatggag 
cagtggagga 
tagaagcctg 
cttccagcca 
tggtcccatc 
aaaggaaata 
atctatgact 
tctgtcttct 
tggaggtggt 
agccagcctg 
gttagtaatt 
agcaaatgaa 
agagatgcct 
ggttataaaa 
ggaagtccat 
tatattcatc 
agtggaaata 
ttaaaatatg 
tataaatact 



gccagagagt 
cagccaaggg 
agagctatcc 
gacaaagttt 
tctttccact 
cagtggccat 
gcctgccacg 
gattcatgcg 
aggggccggc 
cctggcatct 
ctgcctttgg 
tagtgcccct 
caagtcattt 
gggacttggg 
tgccaagagt 
tcttgcttct 
ttggatggcc 
ccttcctttt 
cagtggcaag 
tgcccctggc 
acagaactgg 
tggtcctctg 
gtctataaat 
agtgggaaga 
tttcttatct 
catacacaca 
gtaaatgtac- 
actttttttg 
ctgtcctgga' 
tgcctcccaa 
ctttatttat 
tggatttttc 
agtccaccag 
agctcttcac 
gattaggatg 
tctactggct 
tgtagctctt 
gatagtgaga 
tgatatgagt 
tctgattgtg 
ggaggctgtt 
aactgaaagt 
gggataggat 
ttgcgacaaa 
gccacagtgt 
cctcagaggt 
cctggcctgg 
gagaacacca 
tctcctctca 
tctaccctgc 
cacagcaaat 
ggggttgtag 
gcattgcata 
ggcaaaggag 
ggctatatga 
caaagaaagc 
agccagaaga 
ttggacctgg 
tgttcatctc 
tccttttctg 
tgatggtggt 
taaatattga 
aacacatact 
gaatactaaa 



tgatggagca 
cacctgcttg 
tgggcagaga 
atacttccca 
ctctgaaaat 
gtccttggac 
cccccagccc 
ttggaactgt 
agacagcctg 
ctagtcctga 
tcagctgtcc 
cccatttcta 
gagttgatca 
ttctacaccc 
tgccagctcc 
tattcaagaa 
tctgggtcgc 
aggaacctgt 
ctttatgtcc 
ttcgtggagg 
ggtgggaagg 
gggagccctt 
tggggaatag 
aatagaagga 
tcatcatcac 
catacacaca 
tcaggaaggg 
tttatttgtt 
actcactttg 
gtgctgggat 
ttttagtctg 
cacctgtata 
tggtgactca 
atgaactgtt 
gaaggaaagg 
tctcaccatg 
ggttgggttc 
ctctgctcaa 
cccttctgct 
gttcaccttt 
tgttgtattt 
cataggggag 
gtgtaaggga 
gattgccctt 
gtccctccag 
gagaaagggc 
aaccacaagg 
ggaaagagag 
ggacctgcct 
agtttctata 
tcaaaaagag 
ctcagttggt 
aaactgaaca 
cctacattca 
gactgtctca 
agatgtggct 
aggctcctgt 
tagtttgctg 
agggttgtaa 
ctgtcatgaa 
gttctgtgca 
atactaaaaa 
tgtaatatat 
aagattatga 



gctggtcaga 
gcacaaactc 
gaagtggaca 
agaggctttt 
agaaatccca 
tgactgtgca 
cctccctgag 
gggggcgggg 
gaattcatac 
ggaagctgcc 
tccctcagac 
accctacaca 
cagagctcat 
tgccctttga 
tagtctgtaa 
cccctcatta 
tggccctgtc 
gcctcccaca 
tgacccagct 
aagaggagaa 
tctggacagc 
acccccactg 
gtgactccac 
aaagtctgtc 
catgccaaca 
cacacacaca 
ggacattgaa 
tgattgactt 
tagactaggc 
taaaggcgtg 
gcccagattt 
atggggaatc 
gagcctgggc 
gtccttcccc 
ctatggttaa 
catgggtggt 
tagcaaccac 
tactgtgcag 
gcctctagct 
ggcccagtct 
gacaggagga 
actagatcat 
ctgtagcaga 
ccctcgacct 
cactgggtac 
agcctggaag 
gcctatccgg 
cagccagcca 
tcctgggaga 
aacagcctca 
attcaaatgt 
aggtttgctt 
ggttggccca 
aggtaatcct 
aaaaataaaa 
gaaaccgttt 
gccttgtgtg 
tcttagaaat 
tagagaaaaa 
aatgtatagc 
gccatccaaa 
gattataaaa 
ttttttaaaa 
aaagtatgag 
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tttgtgactg 
aaagtggatt 
attttcccct 
ttaaataaaa 
tgcctgtgag 
gccgtctctt 
ctttgtgcac 
ctggagctgc 
ctagaagagc 
catgtttaga 
aagaaaatgt 
tgtgtgtgta 
ctccaccaca 
tacacacagg 
agaataagaa 
nnnnnnnnnn 
nnnnnnnnnn 
aacatagctt 
ttaattttta 
gttaatgcaa 
taaaagtcat 
atgcacacat 
taaataattt 
tgctgctttt 
ttagaaaagt 
cgtcagcaca 
caactcactt 
cacctgaagg 
aagccttagc 
tatttctcga 
caaaaaaatc 
ggttaggggc 
actctgacta 
ggttgtatag 
aaagatttat 
gatcccatta 
ttggaagagc 
taaaagttta 
tgtgtttgaa 
ttgattcttt 
cagctgagct 
agaaggaatt 
gtatgatatg 
cagatcacta 
taatcaaaat 
cctgtaacct 
aggacaacag 
gaacaaaacc 
actctgggac 
agcttccacc 
ctagcggaga 
gggtgcatga 
gaagccatgg 
gtaatgaatg 
gaaacttgga 
ttaaaagaaa 
aagcatcaac 
tcgggctggt 
ttcaaatccc 
ttctggagtg 
aagtattaaa 
ttttgttcct 
tgctaaatag 
tggcctaggc 



tttttaaaat 
caaaatgtta 
tgttttttct 
tacaaaagac 
tggcaggaat 
cactccctct 
cagagtgtgg 
acgctgtttg 
agcaaatgct 
ttttggagaa 
ttgcttggga 
tgtgtgtgtg 
ctaaagtcaa 
aagcattcct 
agaagggaaa 
nnnnnnnnnn 
nnnnnnnnnn 
aaagaattct 
atgggaaaaa 
acaggacaga 
ctatttatac 
acaaacagag 
taagctctaa 
gccattcttt 
aaggagcaca 
cagcaatgtc 
ggactggtac 
ctgcagcagg 
tgcagtggaa 
agtgaatgta 
caacttttgt 
acaagctccc 
ctacacctgg 
caaattttta 
ttttatgagt 
cagatggttg 
agtcagtgct 
aaattaattc 
gtatatatgg 
cctttcactg 
atcttgtggg 
catgtaattg 
gtaagaaaaa 
tgttcaagga 
gatcccatcc 
caagtccccg 
gatatctgag 
aacaacttca 
acagtgtcac 
tgggacacaa 
ggctgcaagg 
tgtgaaactt 
ccaacaaccc 
ctatgtctaa 
cagaaaagcc 
tttcacaaaa 
taagaggagt 
gagatggctc 
aacaaccaca 
tctgaaggca 
aaatatgcaa 
gttgtttgtt 
tggaagatga 
tactctgcct 



atgaatgcat 
agaatggttg 
gtctttctaa 
aaataaatgt 
ccaacattgt 
gtaattttta 
gtgcacattc 
gatcttctga 
cctagccact 
tttgcttgta 
tacctattgc 
tacactggag 
ttctatgaac 
ctgactttct 
tgctcaaaac 
nnnnnnnnnn 
nnnnnncccc 
agagaaaaaa 
gtaatatcaa 
ttatcttggc 
gagatgaaat 
tctttttatt 
aatttaaaaa 
ctgataaaag 
tcctatgagc 
ctggaccatt 
ggggtgaggc 
ccagttccgg 
cctatgcaaa 
caggcaccat 
gagacaggct 
aaggagtatc 
cttttaggtg 
ttcaccaagc 
acagtgtggc 
tgagccacca 
cttaaccgct 
tcattatgca 
ggatgtagtt 
tgggtcctcg 
ccaccaaact 
aaatagaaaa 
ccaagaaaaa 
tggtgtccca 
agacctgcta 
ggcagcacag 
aaggagtccc 
ccatgatgaa 
acactacagc 
tgtcacacac 
gtggagggca 
acaaagaacc 
cagcactctg 
agagatgata 
aaatgtaaca 
aggcacagaa 
caggaacagt 
agcggttaag 
tggtggctca 
gctacaagtg 
taattggaat 
ttccgagcct 
ccttgaactt 
gactcaggct 



acttgtaata 
tttttgtatg 
tttttaaata 
tttaaaattt 
cttctgaaag 
aaaaatatat 
tgcagaggcc 
tgtggatgct 
aaagccatct 
ggaggatggg 
ttcttgagtg 
ctaggaatca 
ttcattaatt 
gactgtcagc 
ctgtcactct 
nnnnnnnnnn 
tattttacca 
aaaaaagtcc 
agaaatccat 
caaattaaac 
gtgaattagc 
cttaagccta 
atagtgaaac 
tcccaaaggg 
tggtgacctg 
tgtgaggagc 
tggcgatggg- 
caggttttcc 
cacacaggaa' 
ggcacgcatc 
cccttcattg 
tgctctccta 
ggttctgagg 
cagacaaccg 
tgtcttcaga 
tgtggttgct 
gagccatctc 
cgtgccacag 
atgtgacggc 
gagcgtcaat 
tatttttaaa 
gcagagttga 
acctctacaa 
gaaaagggtc 
cagagaagct 
cagagctgtg 
agtgagcgtg 
catcatgaag 
ttccacctgg 
tacagcttcc 
ggtacaaggg 
aataagttag 
gaggcaggca 
aagtcccatg 
aatgcatggt 
tgtaaggtga 
gtattttaac 
agcaccgact 
caaccatcca 
tacttacata 
tattctttgc 
gtttcatgta 
ctgattcccc 
cagtgaattt 



tattttttaa 
gtgggatagt 
ttgtgcattg 
ttactctttt 
cagctagcgc 
ttgtatgtta 
agaagagggc 
cagaatcgca 
ctccctctag 
ctacaccaag 
tgtgtgtgca 
tatccagggg 
gtctgaatcc 
cagctaagga 
ntggggnnnn 
nnnnnnnnnn 
agcattgcag 
actaatgttc 
gtctacggca 
cacagtaaca 
gtgcgcgcgc 
ccaaataact 
cccttcaggc 
tgtcataatc 
agttttacac 
cccacgctgg 
caccactttg 
aaatggcttt 
aagggcagtc 
tggaggtcag 
tcttcccacc 
agggcactgc 
atttgaacgc 
ttttttaact 
cacaccagaa 
gggaattgaa 
tccagccctt 
ggaagtgtaa 
cagaggacaa 
agggtcagca 
aagctagaag 
agagggagga 
gtaaatctag 
ctaaaacaaa 
ggggggaagg 
cccaatgatc 
tagcagaagc 
atgtgtggac 
gacaaaaagt 
acagtcagat 
aacaggggtg 
gaaagagaga 

ggagggtctc 

agagttacaa 
aggtggggaa 
aagcgagcga 
tacctttata 
gctcttccaa 
taacaaaatc 
taataagtaa 
aattctacta 
atccaaactg 
tgtctccacc 
tcaaacactg 



aaaaacactg 
acaattgtga 
ctttcatatg 
atgagtgttt 
taacttctga 
tattatgtgt 
atcagattcc 
ctcaggtcct 
tcctcattgt 
tgccaggtga 
tgcttgtgtg 
ccttttcaag 
acctactctc 
ggtgtggctt 
nnnnnnnnnn 
nnnnnnnnnn 
gataaataag 
aatgttttaa 
tattaatgtg 
cgttttgaac 
aggtgcacac 
ctttaagcag 
tatgacagaa 
tgtatccttc 
ccaagtcaca 
tcctgagcaa 
gactgtgtcc 
gaggcaccat 
attcccgttt 
atgataatct 
acacttgcca 
catcagagac 
agatcatcat 
ttttttaaaa 
gaagacatca 
ctcaggacct 
atttaacttt 
cataagcaca 
ctctgtgaag 
ggcattttac 
ttggttaaag 
gaggaggtga 
tcgtgttcca 
gcccaggaag 
agggagaggc 
tccgtgatac 
cagaggcctc 
aaaagggtgc 
cacacactac 
tatttttctt 
agtgggatcg 
tgaagacata 
catgccaaag 
agtcagagga 
aggtggggat 
ctaccagata 
aaaaaaatac 
tggtcctgag 
tgatacggtc 
ataaataaat 
tgaatggagt 
gcctggaacc 
tcccaagtgc 
cttcaacctt 
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gacccctaac 
ctggggcttg 
ctgctgctgc 
gtgcaaaccc 
tggctcttag 
tttttgtctg 
gctttcagtt 
agtaaagtca 
tcatcaacag 
gaaaacctaa 
gtttacaaag 
tggtggaaaa 
gcagggagag 
aggaggggga 
tgtatttgcc 
aattgcctac 
tggctctgaa 
caaaggcagc 
acaggcagga 
cctgctgggc 
tttggcaaaa 
aatacattca 
acagagagaa 
agcccttcct 
cagtctatgt 
tgaaggaaag 
acctatcttc 
gcacctcacg 
tctccttcca 
cctgctgagc 
agcctcattc 
gactcaattt 
gacccgagtt 
gggagcagag 
gttcagtaag 
ccgctgcagt 
ttgaagctgg 
acagtaaaca 
tttaatccca 
tctacaaagt 
aaaaaaaaaa 
aaattttcat 
aagtctccaa 
agcagtcgct 
aagaggacca 
tgggatgcgt 
ggaacaagag 
catctttaac 
agttcatgta 
ctctaattat 
agtaaccttt 
gaattccaga 
agctgcccac 
aaaaccattc 
tgcaaatcaa 
tggtcccaag 
agctatcatt 
caaacccacc 
agcctgccat 
ctctgccagg 
ggtgaggccc 
caatatttct 
tatgaataat 
acagcacaga 



ttccatctct 
ctggctggtg 
tgctgctgct 
agtcaagaga 
aggagcagac 
cgcatagtaa 
ttaatgaggt 
ctgggctcaa 
gacggtgtgt 
aagcagttca 
ggttctaccc 
tatgagttga 
atgagaacag 
ccgagcacgc 
cttcaccctt 
cactatttct 
ggctgaagcc 
taaaaaaaaa 
agtctccctt 
ctcctgaggt 
ttactcttaa 
gcacaaagtt 
aaagtgagac 
ttacagctca 
gaagcatttt 
tagaaatgag 
cagcgaagtt 
aaggtgcttg 
ccagagtcct 
cctttcccca 
tcagaaccat 
ctataaaaag 
ttgttcccac 
gtaggcagat 
ggagcctgta 
ggcatctgct 
aggtcaaggc 
tatattttac 
gcactcggga 
gagttccagg 
aaaaaaaaaa 
tttttataaa 
cttgataatc 
aagtccgggt 
gaagttcaag 
gagacacacc 
ttaagtgcgt 
atctttggct 
cttcatcatc 
atagaacata 
atctcttgca 
gcattgtaaa 
aagacctcaa 
tacaaagcag 
gctatcccag 
ccatgaggga 
cttacagtct 
aacttcccaa 
ggtagtgcac 
agttccatgc 
tgtctcaact 
cagttcctac 
aagtatatta 
tacaggacac 



tgggtcccat 
gtgtcgctgc 
gctgctgccc 
tttcaacacc 
tgcaccacgc 
agactggttc 
tctccctgaa 
tctgtgatac 
tcgtggcagg 
attgtctctc 
tttcccccaa 
aagagtctca 
agttcagaag 
ttctaaagca 
tatacaaatg 
aaaataacga 
agcctcgttt 
aaaatgtccc 
gccactgagg 
cccaactcac 
gtagctcaag 
gagaagatta 
acctatttac 
gccttcccat 
taaaatcagc 
cacacagtga 
cttgccttct 
cagctatgtg 
agggatggca 
gcctctcctt 
ttctgctgcg 
ataactgagg 
atggggcgtg 
cccaggcctc 
accacagaag 
catagccaca 
cagcataggt 
tacaattaaa 
ggcagaggca 
acagccaggg 
aaaaaaccaa 
aattacagtt 
tgaggctggg 
agtcccttgc 
gtgctcctcc 
agtaacagca 
tactcccgta 
gagaagagaa 
atgcagttaa 
agatgtggta 
aatctggtac 
tagcagcaat 
atcaatcagt 
taacgtaggc 
gaagccagat 
actagattac 
attatggcaa 
gcaagcattt 
acctgtaatc 
cagcctggtc 
cccaaaccac 
tgacagaaaa 
tacagtacct 
acacacagct 



cttacccatt 
agaggctggg 
atatcctgga 
agcagtaaat 
tgcacaccct 
catttccctt 
gccttgccca 
taccactatt 
atccattaga 
aagtgctttg 
acaaagcaac 
ggtaccgtga 
cagccgtgtc 
gcaaataaga 
tttaccaagt 
gtttatgggg 
aggctgctca 
agagctcctg 
ctctcccttc 
cgggcatgct 
gaatgagacg 
aatagaattg 
ataggagagg 
tatcactcaa 
aagagaaagg 
cctgctgcat 
ttaaaaagaa 
gagggcagag 
caaaggcacc 
ttctacagtc 
catgtctgac 
agccggcttg 
cgtgctagag 
aatagccagg 
acactgtgcc 
gctacccagg 
accataggta 
aaaacaaggc 
ggcggatttc 
ctacacagag 
aaacccaaca 
ataattttta 
ggtgtagctc 
ctgtatccca 
tcttcaggta 
actactgcct 
actgtctgca 
aacctggtat 
tttgtaaaca 
attaggaaaa 
aagagacagt 
tgcccttcaa 
ctatagctaa 
tccgtttata 
cgtaattctt 
agcaggctcc 
gtgagctctg 
tcttaacaaa 
tcagcatgtg 
tagacagttg 
tgaaaacaag 
tggacacaat 
agtttaagtc 
gcaagcgtga 



cccagactcc 
gaagggactg 
gatcaaaaca 
accttaggaa 
gtgtctgcaa 
tcaactgttt 
tttctccttc 
tccatagcaa 
agagaaagca 
gcttcaaagg 
cttattttgc 
agtgctagtg 
aaggcaacag 
cgcgccaccc 
tgaagaatgt 
tttgatttta 
cctggagtgc 
aaacactaaa 
tccaactgta 
acatgcccat 
taattgtgtt 
attatgcttt 
ggcccaggct 
ctcagacacc 
tgagttgctc 
aagacacagt 
tgtggtattt 
accatcctga 
gggcttgaca 
tccaaattac 
tgaactggat 
gttggtagag 
aggtgaggac 
cagcccaacc 
gtgtaaccac 
aggctgagcc 
gacccccatc 
cgggtgtggt 
tgagttcgag 
aaaccctgtc 
catgtaccta 
gttgaacata 
atttagtaga 
gtgcttgggg 
atgaggagtc 
ggagtgctca 
caatgatgga 
tcctcagctc 
accaatcctg 
gctactaatc 
cccaaatcaa 
ttaacgcatt 
ggaaaaatct 
ataacctgtt 
aggctctgct 
gccctcggtg 
ggcagagaaa 
cacaaagaat 
ggaggcagag 
caagatcaac 
taagacgata 
taggctgggt 
tgagcaagat 
aggatttaaa 



cattctgtgg 
cttgctgctg 
ggccaattca 
aacccacctt 
agggctcact 
agaattgaca 
aagaacctgc 
caaatcgatg 
gggtggtaag 
aaaaggagac 
aactgacagc 
aactgtcaca 
ggagataaaa 
tgtcggagtg 
taacattgta 
cttctgtgat 
ttgtagatga 
actggtgcgc 
agttctaact 
ggaattcatt 
gctggagaca 
agttcatcct 
actgacaatg 
cagcccagtt 
agtagtacac 
ttaaaaggtg 
gctggtatgt 
gggaatcggg 
gatgccttta 
ctgaggtagg 
atggtcactc 
aaaacataaa 
ggactaggca 
tgagctctaa 
ggcaagcggc 
acaagactca 
tcaaagttga 
ggcgcatacc 
gccagcctgg 
tcgaaaaacc 
ttctaacata 
atacaatgat 
gtacctgcct 
aacaggcaga 
tgagccagcc 
ctctggacca 
gacagtacgt 
gttctggcta 
gcagcaataa 
cacttaatag 
atgatggcaa 
gtaacgcagc 
ttctaaagcc 
ttgggccacc 
ggctacacac 
acctgctcat 
aattcacaaa 
aaataaatag 
gcaggcagat 
aacgctatat 
tggatcaatg 
ataattccaa 
attatcgcca 
ggcacccatc 
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cctacagtat 

catttttgct 

aattcttact 

gcgctggact 

ctttgctgtg 

acaatactcc 

aaaaaagcac 

ctcacagaaa 

aacaagtcca 

gacagacctc 

taaagacaca 

gctggggtac 

cattcccagt 

gtgtacccag 

ttgggttcaa 

aaccacagag 

gggagggggg 

gtaaacaagt 

aattttttta 

ccatgtgggg 

taaagaaacc 

taggagccca 

aattaaaaca 

gttaagagca 

agctaacgcc 

gcccgcaaac 

gattctttca 

ctctgcagac 

ggaattaaag 

aaagcgtttt 

acataattaa 

gattcactca 

atgaagactt 

tctttgtgtc 

cttcctacct 

tggactacaa 

tataagtgtg 

caatattagg 

cttgaagcaa 

acatacacac 

gcggtgagga 

gacaagccag 

atgccaggga 

agaagctgta 

gatatgggtc 

agcacacacc 

tggtccacca 

gtggctgcaa 

agaactggga 

acagaaggag 

aaactggtgc 

aaaaagacta 

gaagcaagcc 

tgttaaaaaa 

aggaggcaga 

agtgggttcc 

agagaggaag 

cagcgttatt 

aagatgaaaa 

gctcattctt 

gactagagtg 

aactggagtt 

aggcctgaag 

ttacaaacag 



atgcagcatt 

caatatggca 

agcctgggct 

agcttccaga 

gtgaaaggtg 

tggcttggta 

agagctcaca 

tcaagaagaa 

cttcagaggc 

actcagaaca 

gaagatgggc 

gctgtaacaa 

taacccagca 

gaggtaattc 

attaggatca 

ttggttaaat 

ggagggaagg 

ttcaggctca 

aagaggacag 

tacaggagca 

ttaaactgtg 

cccattgcac 

aacattaatg 

cagcgcccat 

ctcttttggc 

attcaaagaa 

tttttttgag 

caggctggcc 

gtatttaaca 

gttgtttttt 

aaatgccaat 

cacaacttat 

ctggatacat 

agacattagt 

attgcagtgg 

ggcagcaagt 

tgtgggcact 

catcaagagc 

aagaaactca 

acacacacac 

actgtaaaaa 

ggcctctgcc 

ccactctaac 

gagagttgat 

agccatgctc 

ggacctggca 

tagtggctgc 

agaggacaaa 

agtacccgct 

agactggtta 

ccatcactac 

cagagtacgc 

agactggaga 

aaaaaaaaga 

ggcaggtgag 

aggaaagcca 

tggatccaag 

aaatgacagc 

aataaacaca 

gttatccaaa 

agacactgct 

tgagggaggg 

cccgaaggtt 

tttaaacaca 



gactgtctag 

gcagtaataa 

gtgtttgctc 

ggatgtaaat 

ctgtttctga 

acaatgccaa 

tctgagccaa 

aaacaagcaa 

caaaaaccaa 

caagctatag 

tgctggcatg 

gcagaggaga 

atcacttctg 

actgcggctt 

agcacacttt 

aaacaacaga 

gggagaagct 

gtcggggtat 

aaaaaagaca 

gcatttgcat 

ttatctataa 

gccatagttt 

cttgttaaag 

atggaggctc 

cttcaagagc 

aaatacaaac 

acagggtttc 

tccaacccag 

cacacattat 

gttaagtttc 

gtgaaataaa 

atacttaaat 

tagaaacgta 

gtgagtgtct 

ttctcgctaa 

cccagcaatc 

agccttgtta 

tcttaactgc 

cagatggtca 

tcacacacca 

taaggtgtca 

ctcatcagct 

atggagctgg 

ttgatggatt 

tagtgagtgg 

agtcaataaa 

tcgtggttgg 

agtctctgga 

gggacagaag 

tcggaggaaa 

aacagcagta 

accctggcta 

aaaggaaaca 

aagccagaca 

gcaggaggat 

ggcaacaaag 

agcaagagag 

agaaaagatg 

agttctaaat 

caaactaaag 

ttaaaaaagc 

agtgggcccg 

gaactgagaa 

ataccagcaa 



ttttatttcg 

aatgtatatt 

actaactcca 

ctaactttgg 

agccacgaca 

aaaataccaa 

aaaaaccgac 

acaaacagca 

aaggcaaagg 

ccatgatact 

actgggaaga 

atgtctaact 

gcaaaaaaag 

ctggtggatc 

tatagtcaga 

acagccacac 

agaatcccag 

aaggtaagac 

gcgtagagaa 

ttggttactt 

agggcagcag 

tgattacaaa 

aaacagggct 

tcagccatct 

actgcatgca 

tgctataaaa 

tctgtgtagc 

agatctgcct 

acatatcttc 

aattaaaaaa 

taatatatac 

acaattttca 

ccctgaaaat 

aaacttgcat 

acctggaacc 

ctcctgtctc 

catggctgct 

taagccatct 

gagtttacac 

aggcttagtg 

ggaaagctct 

cttagtaact 

gatgggctcg 

ctaaagtagg 

ccccacaccc 

acaaaaacaa 

gagctgtccc 

actgatcaac 

cagactctga 

tctgaaacat 

attgaattgg 

tcatcaaccc 

aaaagttatt 

tgggggtgca 

ccatgagttc 

aaaccctgtc 

agagagcaga 

ggcccgacca 

at'cattttaa 

caggggtggc 

aatagatgca 

ggaagcctgt 

catctcaaga 

taaattgtca 



cacactttga 

tgcattttga 

gtggacttct 

ttctcggcct 

gtcccatggt 

aaaaaccaaa 

actccctatt 

cataacaggc 

ggctcttaaa 

gctcttca'tc 

gaccagtgtg 

gtgtctgcca 

cccactctgc 

ttttttcctt 

gcacggagac 

cagaactgct 

cattcaggat 

tttatttcac 

ctagctatct 

ggcgaatagc 

tgggtgcaca 

gtcacgcagc 

gcagagatga 

gcttctccaa 

catggtacgc 

cccatatatg 

cctggctgtt 

gcctctgcct 

cttcttcttc 

ctacatagtt 

atatataaca 

caataatgaa 

cgcaaatgac 

aaaggctctc 

tggggctcct 

acctttcttg 

gggactagaa 

ttctaccctg 

acacacacac 

accactgtga 

gcacaaaatg 

atggttgcct 

agagjgccctg 

gggaatcgat 

acccatgagt 

aaacaaacaa 

gcacttatgg 

tctagagtct 

aggtgatcag 

aactcgacgc 

gcacaacatt 

aggcgattct 

taacaaaact 

cgtttttaac 

gaggccagcc 

tcaaaaatca 

gagagtgggg 

atgacatccc 

taatggctgt 

cgggatttac 

cgctaaccat 

gcctataaac 

caaagcacag 

gctttatgac 



atcatccatc 

tgcatggtgg 

gacgtaagag 

cccctgaagc 

ggtttgagta 

accaaaacca 

ttttgaagaa 

taacaacaac 

catttacaag 

catcaattct 

ctcactcatt 

atcacaggca 

ccgcacactg 

cttgcagtgc 

aaaaggatct 

ggggggcggg 

gcagaggttg 

aaaaataaat 

tttatgtaaa 

cctggtagga 

atgcagtggg 

tctactcaaa 

tggctcagtg 

ttccagggga 

ttacctacat 

accatcttaa 

ccagaacttg 

ccacagtgct 

ttcttttttt 

ttataggcaa 

ttctgtaata 

aagctttgaa 

ggttttcatt 

ttctctatca 

gttccttggc 

gaaccaatgt 

ctctggtctt 

attagaattt 

acacacacac 

aaagggaagt 

gtgtcctctg 

acagcaaacc 

ttattaaagg 

ttcctttatc 

atgtgtggac 

ataaacgttc 

caggaagaca 

gcttgttatg 

gacagagatc 

atactggtcc 

cagaaaacag 

ggcactattg 

tcccagagca 

cctagcactc 

tggtctgtac 

ctgactgggg 

gtttggatgt 

agaaatggca 

gtgtctgctg 

ggccagccag 

taatcagcgt 

ccaacctgcg 

gcacaatctc 

agataggctg 
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acaggcatac 
cgacacagct 
cacagcatgg 
tggaaaaggg 
agggcccagc 
gtacaactat 
gaagcacaaa 
agttgagaca 
ctctttacaa 
gccagaggcg 
ggtccaggga 
ctacaggagc 
tcagggggct 
gagttcagtt 
taaacatttc 
aaaaaatctc 
agcaaagcaa 
aaaccccagg 
accgaaacag 
accattcact 
cgatgaacag 
gatagctgcc 
gatccaaggt 
ggaggagggg 
tgctggcctc 
ctcacccagt 
acaaagaaag 
gatgtaggct 
tgctatcacc 
tcctcagcta 
cacatctgta 
acttaaaaat 
nnnnnnnnnn 
nnnnnnnnnn 
tgtcatcttg 
attttattaa 
ctgtgccagg 
gtaacatggc 
gcttacttat 
agcgatggga 
ctaacctcag 
agcagagccc 
ggcgggaaga 
ttgtgtgtgt 
tgcatgtgtg 
catacaatgg 
gaacacattt 
caggaaatgg 
ctttcctggc 
caggcaccag 
aaataaaaac 
gtggatattt 
caaaaactgg 
ggtccaaagt 
tcttctggag 
aaaaaaaata 
agggaaagcc 
aggatcacag 
agaccccggt 
gccacaaaac 
tggtaatttt 
aactcactct 
tggattttga 
ccaaagaaaa 



cacaaagatc 
gagcctagtg 
ccagagtgac 
aagaaggcaa 
atccctctcc 
aaatactcct 
cataacactg 
tcaaacacac 
agtctccaca 
gcggcacaat 
gcactagtga 
ctagacaccc 
gatgagatgg 
cccagaaccc 
tatcctccaa 
tcgtggacag 
atccaaatat 
tcgctttagc 
aaaaggcatt 
cttaataaaa 
gaacacaaac 
agactgggct 
gctgcctccc 
tttgacacag 
aaaactcttc 
tttaatgcca 
atcacacaga 
ggtctcaaac 
catggtggca 
ccatgggtgt 
accccaccac 
aaataaaaac 
nnnnnnnnnn 
nnnnnnnnnn 
aagggttgaa 
aaaacaaaac 
gaacccactg 
tgcttacgca 
aatatcaaac 
aaaataacct 
ctgcagactt 
tgtgcctgtg 
gatggaaagt 
gtgtgtgtgt 
tgtgtgtgta 
ctatgcttat 
ctaaaatttg 
gtaggagagg 
accaactagg 
gcaggaaagc 
ggagcagaag 
tgtgagttcc 
gggctggtga 
tcaaatccca 
tgtctgaaga 
cctttaaaaa 
acactcgatg 
cgggacggag 
ctcaaaacaa 
accttcttct 
ttgtttgctt 
gtagaccagg 
tttgggggtg 
aacacccccc 



gggaagaaga 
acaggccgtg 
agggtcacag 
agagcaccac 
cacctgacag 
tcctagtgta 
taagttacca 
gtttgatcct 
tttataaacc 
ggtggctgag 
attcaagtaa 
tgtaaggtca 
ctcagtggct 
gtatcaggca 
gcacagacac 
acagcaaaaa 
gcgaaaagga 
atccaaaaaa 
tgattattaa 
atgttaccaa 
tggctgggca 
ggcatgagaa 
ccccccccct 
ggtctcactc 
tgcctcagtt 
tttaacagta 
caaacatcac 
ttgcacaaga 
tttcggaagc 
gaggccagcc 
tcaggaggct 
ataaataagg 
nnnnnnnnnn 
nnggaaaaca 
atcactgagg 
tccttctgga 
ccagcagaca 
gacaccacac 
ataagtacca 
gcacatggtc 
gtgggtactg 
ccagggaacg 
ggagctaggg 
gtgtgtgtat 
tgtatgtatg 
gtgtctttta 
gaatacttgt 
caggagagat 
tcactcacaa 
ggccccacat 
ctgggcctgg 
gggtctacat 
gatggctcag 
gcaaccacat 
cagcaacagt 
aacaaaacaa 
gcgcaagcgt 
gccagcctca 
aacaaaacaa 
tcccaatcct 
ttcaaggcag 
ctggcctcta 
tgtgggtgat 
ccccaaagta 



aacgggattc 
ataagctcaa 
caatacagca 
gagagaaaag 
agaaaaccag 
gacagagttt 
gaagtttaga 
aggggaatta 
aaatacactt 
gcagcaggat 
acattaaaca 
gtactactaa 
aagggcatgc 
gttcgcccac 
acataaacat 
atactgaacg 
taatacaatg 
ttcaatcatc 
taaatacagg 
acaggagaaa 
cagtggcaca 
ccaagggtaa 
gtgtcactta 
tgtggcacag 
ccagagtgag 
caaggttaaa 
acaggtgctg 
aaagaatact 
agaggcagga 
aaggcagcac 
gaagcaggaa 
ttggagagag 
nnnnnnnnnn 
cccattagac 
aatggctaat 
ggagtactgt 
cccaacttca 
acacttcctg 
tgtgagcgct 
agtaatgggc 
agaacggacg 
gaggacgctg 
cccgatgacg 
gtatgtgtat 
tgtgtgtatc 
ccacaagtaa 
tggggaagct 
ggcttagtgg 
ccaggatctg 
ccctgcaggt 
aagtccttta 
agtaaaaact 
tgggtaagag 
ggtggctcac 
gtacttacat 
aaactcccac 
gtgatcacag 
gctacacagg 
aaaaagcgtt 
tgaatgctat 
ggtttctctg 
actcacagag 
accacaggga 
tgaacatcaa 



ttgcacaaca 
aacatgcaca 
acagagacag 
ttaacccgcg 
gaggacgcag 
accaaagggt 
cacgttgaga 
aattattggt 
gtatcagaat 
caagtctgga 
ttaaataaga 
gataccaaaa 
actgctcttg 
ctgtaatgta 
aattaaaaat 
aaatctaatc 
caaatccaga 
ataattcacc 
gaaagcattt 
gaagtaaaat 
tcctgaaacc 
gaggcaccca 
aatccgtaat 
gctagcctga 
gggaaacaat 
cgctgcctga 
tttgagacta 
"tttgccatca 
agatcagtag 
gtgagccagc 
gttcaagcca 
ggnnnnnnnn 
nnnnnnnnnn 
tgctgaagag 
gacaaaactg 
gctcccttca 
aggataatca 
tatgctgggc 
acagggcctg 
aatccactcc 
tgactaagtc 
ctcaatgggc 
tgaatatggc 
gtggtgtgta 
caaaatggtt 
aaaattttaa 
atctgagccc 
ttaacacaca 
acagcctctc 
aaaacatttg 
cctccagcac 
tgtcgccaag 
cacccgactg 
aaccatccgc 
atattaataa 
taaaataatt 
tattctgtag 
ccggtctggg 
cacattatca 
cgaatttggt 
tgtagccctg 
atgtaattct 
cacttgggcc 
ctgtatataa 



ttttacaaat 
ctgcaaacac 
taagaaacat 
attccatatg 
ctgaaacact 
atcgtaatct 
atttaataga 
aacagaaaag 
tgttgtctga 
gaagcccctg 
aagacggatt 
ccaaactgac 
cagaggacct 
ccagagaccc 
aaatcttaaa 
aggtagactc 
cttatcccag 
gtattagcta 
gacaacattg 
ctcctcaacc 
ccagcaccca 
aacacttaat 
taaatttctg 
atggagtcaa 
catgagccac 
aaagaacaag 
ggtctctagt 
agatcaaggt 
tacaaggtca 
tgtggaagca 
cacaagagcc 
nnnnnnnnnn 
nnnnnnnnnn 
actgagaaat 
cttaaggaga 
gcattaacta 
agcttgctac 
atcacctcta 
gccaatgaac 
ccagacactt 
aattcacaga 
agagcttcag 
taatgccata 
tgtatgtgtg 
ataaaaatcc 
gtaatcttag 
cagcatgaat 
catggtggcc 
ctggcctcct 
tactgaaact 
cgggaagcag 
taaaacaaaa 
ctcttccaaa 
aacaagatcc 
ataaatcttt 
ggaaaagtcc 
gtaaggcaag 
ctacagtgtg 
tatactcaag 
tacttttttt 
gctatcctgg 
ttttcaaagc 
ccaaaccaaa 
aactaacagt 
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tcatataagc 
tctttcagta 
atgtaaacga 
ggtcagggaa 
cgtcctcacc 
tgcccaagcc 
actttaaaag 
aagaccttgt 
acaatggcaa 
agggaaggac 
actagtctat 
ctctgcatta 
tgttcatttg 
aaccgcatgg 
ggaactcggg 
cccaattcac 
caggctagct 
gtttccaccc 
ttcacaccaa 
atatgcactt 
tttactaggc 
accagctctg 
tctgtgcaat 
ggtaagtcct 
tggcataagg 
tccccaaggt 
acccactggt 
agcgaataaa 
taaagtaact 
taacggttcc 
acagtatcta 
taggtgattc 
ggccgtgcta 
gcttaaaatg 
ccaagtaaac 
ggcttcgcac 
cctcccttat 
aatctgcccg 
ttaccccgta 
gctgcccagc 
ggcctatctg 
acaccgtagg 
tgggaaattc 
tcttattgca 
ccaccgctgc 
aagcaacaca 
gtgtacatcc 
aaatcacacg 
ctttttccat 
ctctgtctct 
tctccggaac 
cctcccaagc 
tatagttatt 
gtgtgcagtg 
cctactgtgg 
tgagttatct 
tcatataaac 
cccctcacca 
tactcactga 
tctttaggga 
agccattcag 
agacaacctg 
ctcacaggct 
atgactaaga 



taagtagcct 
gatgccattt 
aaatatgtta 
gcacggctgc 
agcaccaaga 
tgaaaatctg 
tcaattccag 
gggtgattta 
aagctcctcc 
actatacaca 
tttttttttt 
aataaaccta 
ttttgactgt 
agcaggtctt 
tggtcagatt 
ctgtttactc 
ttgaactcac 
aagtgctggg 
agtggcttcc 
gcgtttctct 
tgaagcacac 
atgcaaaaac 
ggcattccct 
cactcagtgc 
ccagcattat 
acatagtgtt 
tctcttgaaa 
taacaaaagg 
tccaagggag 
cccttatgag 
atgaattaca 
tgaccaagcg 
gccctgctac 
tggtcgaact 
ttccgcagat 
ggcaagtttt 
caatatgctt 
catacttact 
acaagtgctg 
agggcaaagt 
atttcctcag 
cttctcacaa 
ccggtcacac 
gtcctgggtt 
cacagtgctg 
ggtctggaac 
tagctcccac 
tgctacacgc 
tttggagacg 
atagagtgcc 
tcactctgta 
gctaggatta 
atttgttaca 
tgcacatgtg 
cgtttgggga 
cgacactgac 
cacagcgtgc 
tgtgagtgag 
gttcccttgc 
tcagtaagtt 
tcagtctcta 
aagacggaag 
tgacaggcaa 
tcttctaaat 



gcaggtacat 
gagaaaaaaa 
aatcttaaaa 
aggcccccag 
acagccaact 
atatgaaaac 
ctatctgtgt 
aacagcgtgg 
tccgtggcct 
ggtaccaatc 
accttagcta 
gtgtataaga 
gtggtgtgca 
ctccttccac 
tgcttggcaa 
taatgtatat 
tgtggacact 
acaacagaca 
tatgaccttc 
ttttttactt 
ttcaggagac 
aaaaaacagc 
gacgcattac 
gcgcatccac 
cacagcgcgg 
ctgtccaagg 
atctgcaata 
aagacacccc 
aaatgacatt 
gaagtattca 
ggagccactc 
taggctatgc 
atgcaggtaa 
caatctgcta 
gtcggagcat 
taatgagcct 
gacatctttt 
gggcagccac 
gggttacaga 
cttccttatt 
cacaccccca 
tccacactac 
ttctcacagc 
gggcaggcct 
aaatagtccg 
aggcaactca 
gctcacgagc 
agaacccaaa 
gcggctcact 
tggctaccac 
gaccaggctg 
aaggcatgtg 
tatgtgtatc 
tgcaggtcag 
actcaggttc 
atgtaatgat 
atgtgcaggc 
ggagaggaac 
tggccaagca 
tttccccaag 
ctgccactcc 
gtgagccaat 
ttcttggact 
ggaaaaatgt 



ggttatggaa 
aaaaaacaaa 
aaccttaaag 
agcagcacat 
ataataagct 
agagggcaag 
ggaaggactt 
agggcagaaa 
ctacacgcat 
cccacgaact 
gtgtctttct 
aggcaaatga 
tgccaggcgc 
caccccaggg 
gggcttcact 
atttttgaga 
agctggcctg 
gcaaccatca 
tcctgacaac 
caagctttga 
accctgtagc 
gatggggatg 
ctcacaccag 
cacacccaaa 
acaaacacaa 
atggcctctg 
tttgcctaaa 
aaggacgagg 
gccgccacac 
cttctggtca 
aataccttta 
aaatgctctg 
tgctacgcca 
gaggtgtgag 
cagacagagc 
ccatggtaat 
agttttttga 
agctggccct 
catatgccgc 
tacaaagcag 
agggtctcac 
tcaccgataa 
ggtacaaccc 
ggtacataca 
cctgcaggcg 
aagctgctct 
gatatgcaga 
gaagtaacaa 
agctagcctt 
acctgggcac 
gcctcaaact 
ccatcagcgc 
agtgtgtatg 
aggacaactc 
caagatagca 
tcagtgtgca 
taggaaacaa 
tcaggctgtc 
ttatttataa 
acaagccaga 
tcaacctgtc 
aaaatcttac 
gaatgcgttt 
ctggttaagt 



acagctttat 
acattctttg 
cagacgccac 
tccctgaggg 
cataaaaaat 
acagataaaa 
ctcgggacgg 
aagaaaggaa 
ccaagcgata 
agaaaacaca 
tatgtttggc 
gtaaaggtaa 
attcgtggag 
gtcgcagggg 
cactgaggcc 
cagggtcttg 
gaggtctgag 
aggcaaaatg 
cccaaacagc 
ctccactttc 
tctggagtga 
gggcaaaggt 
atacatacag 
gcaagcattc 
agccaacctt 
cagactcctg 
aaacacaact 
gaagattatc 
acagcccccg 
ccccgtgttt 
tcataaagtg 
agcacactta 
gtcttacgcc 
tttagacctc 
tgtcccctcc 
agtgggttcc 
gacagcaaat 
gaactcctcc 
catgttcagt 
cagccaagcc 
cgatggcagt 
gatcatgcgg 
attctgctgg 
gttctctttg 
aaaggaagac 
tccttgcaag 
atcactaact 
accggcactc 
gaactcagag 
tttgttttcg 
cagaaatcca 
ctgacccaat 
acgtccatat 
tcaggagtca 
ggaaaagtgc 
cacagtggtt 
cctgtgggag 
aggctcggtg 
gatggtgatg 
aaaacctttt 
tgtggggcag 
tgacagtaac 
aagagaatgc 
aacccagcaa 



catacagact 
gattttcaac 
tgctctttgg 
aagccttgtg 
cctgaaaggc 
ggcaaactat 
actatctgct 
agtgcaggaa 
tggagggagg 
gcttttacta 
ataatttctg 
agtcagttaa 
gtcagaggac 
tcccagggat 
ccccaagggt 
ctatgtagtc 
tgaacctcct 
aagtcttccc 
aagtgcccgg 
ctaaaaggtg 
ccggaaacac 
ggcttgtgct 
aaaagacaaa 
aacagctcag 
ctctttgagt 
gaaacacgtc 
ctattatttc 
ttttcctact 
actctttggt 
cacttttagt 
ggctttgtgt 
ggggaggctg 
acaggaagaa 
gggagacggg 
tgatgcaaaa 
tctctctcct 
aacgtagccc 
tgcgtctcct 
gtaagtgacc 
ctgcagccca 
cagtccatga 
tatttgaaat 
tcaatcactt 
cggagaaaca 
cgccatcagc 

tggtgagcgc 

ctggtttcag 
acgcacctca 
tttggtctgc 
agacagggtt 
cctgcctctg 
tctttttatt 
gtatatgact 
gttctctcct 
ctttaacagc 
ttgcatgtag 
ttggtgatct 
gcagtgcctt 
tctaccctta 
aggcccaatg 
gacaagccac 
agccagccag 
agaattaccc 
ggagctaagt 
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cacgcaagcg 
aatcatgtca 
cctgcccccg 
tatacagacc 
aggcatttgc 
catttaaaga 
agtggaagag 
cgaatacata 
agctggttgc 
ccaggggatc 
tgcaggatgc 
gtggatttga 
tggaacatgc 
gaaatagctt 
tcaaagccca 
tccgttgtct 
gatgtttaat 
ggatcgagca 
aataatggag 
tccaagagaa 
cagtggttaa 
gtggtggctc 
ggctacagtg 
aggaagagga 
gccagtctgt 
gctacattat 
caggcctgag 
aactcaagcc 
cctggcccag 
cggcccctcc 
caaacctagg 
tgtgctgtga 
cacctagtcc 
ccctcctcct 
aactgggctc 
cccctccccc 
aagggacatt 
gcccaatcag 
ctcagtgaga 
taaaattgac 
gaaggtagca 
ctacacgatt 
ccaacagagt 
aatccgtctc 
gacggctcat 
ccactcttag 
gaacttgtag 
tgtgatcaca 
ggggttgtgc 
gtctctggtc 
agacccacac 
taaatattgt 
ctgtttttct 
ggtactacag 
ctcaatcctt 
cttaaaaagc 
tgggaatgac 
cagtgacaac 
attctacccc 
cagcgcgcag 
taattgcttt 
ggctttcagg 
ctcaatagca 
ctctggttta 



gtggatacct 
agtacctaca 
gacccccacc 
agctgttatc 
cactatgcct 
gacgaagcta 
cttgcttagc 
agaaggctta 
ttaatttccc 
tgatgtcctc 
ctcgggccac 
gtttggtccc 
atgctctacc 
ttaggaggat 
atttaaaagt 
gtctttagtt 
tctcaagatg 
cacagtgaaa 
cagtggcgag 
cgcaggttag 
gagcactgac 
acaaccatcc 
tacttacata 
ccactgagca 
ggcatggcag 
gtccctctac 
gtcccaggta 
cctgccgttc 
gttctcagat 
actcctgtgg 
ggagaaagaa 
atgcaccagc 
actcaagaaa 
ccacccgcaa 
acattgacag 
caccctcggc 
tttgtttcat 
ttatagaact 
atttgtttat 
tatacttctt 
aggacaggct 
ggtggtaact 
agcggtggcc 
accactacta 
caggagtgaa 
agccaagatc 
atcaatgaca 
gttactgaca 
tcagagtctg 
agacttctgt 
cactgaggat 
tattggttta 
tgagtctctt 
ctattcaatt 
ctacacaaca 
tagatatggt 
aatgtacagg 
acaacagaag 
cacatcagct 
ctgttaaagc 
ctctggctct 
ccaccatctc 
aaggcttggc 
ccagagcccg 



gctttgcctc 
aaatccagat 
cggggtctca 
aaattcagag 
ggctctcact 
gaaaccagtt 
atgcacaagc 
gaggaggggg 
agtccccaca 
ttctgacctc 
tatgctggct 
tgggacccac 
cgcaaattaa 
gcctaaggaa 
ttgagtgctg 
taagagagca 
tcctgtgata 
tgagacagtg 
cgatattttg 
accaaggtag 
tgctcttctg 
gtaataatat 
taataaataa 
cacattgaaa 
caggaattac 
actggccgtg 
aggacacagc 
tctgggtgtc 
ttagcctcat 
tcagagatgg 
acaaacgtac 
cccgagctgc 
tgcaatgccg 
ggtcagacaa 
cagccatgaa 
ctcaagcctt 
gagtctgcca 
caatcagaag 
ggaaatatca 
gaccatctat 
agtggcagag 
aaagctgccc 
caaggcaagc 
gccgctcaag 
cagagccagg 
taaagatcgc 
gtattttgtt 
gtgaaatatg 
ggagaaaaac 
cctacaatgc 
gccttaagtg 
catcagactc 
aaattctacg 
tccttaagct 
ccctcaggat 
gtggaggttc 
caacaaagag 
atgtgaccag 
gcccacagtg 
cttgccgtac 
ctatgttcat 
agaagacata 
aaaacagctt 
agcacagata 



tgaaccctgc 
ctagcctgaa 
ctgtgtagtc 
acccacttgc 
ggttgtacca 
tagacagctt 
tgttggtttt 
tggtggtgga 
tggtggctca 
cttggggaca 
caggggtgaa 
aaggaagagt 
aaacttaaaa 
cttctctgcg 
tcacgtgttc 
actgggtgag 
gcattataag 
aggaagaaaa 
ggaatacaat 
gaagaggatc 
aaggtcatga 
ctgatgccct 
ataaatcttt 
tggaagatgc 
agtctgttca 
actgaaaagc 
cctgacctgg 
tggagcgagg 
gcaaagtttt 
aaacaccatc 
tgctgacagg 
gccccctcgg 
ttgctccttc 
actgccaaag 
caaaggcagc 
caccccaatc 
aaaatgtttt 
gcattatctg 
aggttaaatt 
tggctaatct 
gacagtgagc 
ccacacagga 
agtcacatcc 
tttaacttca 
agacaggcag 
taatccttgc 
cgggggacag 
aacaccttaa 
caaccaacaa 
aacgctagct 
acagacaccc 
aaaacaaaaa 
acacagctgg 
tgggctaggt 
gaagtgctct 
taacaagctc 
tccattgttc 
ctccagagac 
caaactgaag 
ctggcgggat 
caggggatcg 
atgcctcgaa 
ggcagcagtc 
acccctgagc 



acaggttttg 
ttcaaagata 
ctggctgtcg 
ctcccaagtg 
gaggagcaaa 

tggggctgtg 

aaccctcagc 
gagatgggtt 
caacatccat 
tgcgactcat 
ggtgcttgct 
ttgacttcta 
tttaaagagg 
ttttcaggtg 
tgtatgccca 
aagtaactga 
ttgctgtgga 
cctatattgt 
agaaataaat 
actgggctgg 
gttcaaatcc 
cttctggagt 
aattaaaaaa 
aactgaaatg 
aaacccagca 
agcaacgtgg 
cactaggaaa 
ggtgcgaggt 
ccagttggtg 
taaggcttgt 
aacttcacat 
ctctcaccta 
catctcgtgt 
taccacagac 
aacagcaagc 
accaaaatga 
catgggagaa 
ctgtgttaga 
catttatatg 
gaaggaaaag 
ctgagatgaa 
ctgactccct 
gtcagaggac 
cacaggcatc 
cctctcttat 
ctgggtaggg 
acatgcctgt 
tgtagacaca 
accagaccaa 
atcagcacag 
cagatggatg 
tgaggagcgt 
aaaccacaca 
aaagtatttt 
tccaaggcag 
tcctcatctc 
tcactgtact 
tcagcagagt 
atgctctcag 
caaggtcagc 
tgggtttgtt 
gaagtatagg 
cacatgctga 
tgagttgcac 



gtttatgttc 
ggctacctac 
tagcgctctc 
ttgggattaa 
acaatgtggc 
ggtgtagctc 
gtgacagaac 
agaggttaag 
aactgcagtt 
ttggctcaca 
gccaagctgg 
tacactgagg 
aagctgtaga 
agattcagac 
gttctggctg 
gagtctagcc 
tgacagtgat 
atacgaggac 
gattcaaata 
agagatggct 
cagcaaccac 
gtctgaagac 
aaaaaaaaaa 
caataccaga 
cggacctgag 
taccctggag 
caggtgtaaa 
accttgtctc 
ttgccccctc 
ccttctgagt 
ccttctcaaa 
ggtctcagca 
ccttcaatgc 
cctttcccta 
cacactggca 
accagatgaa 
aatcttttaa 
gatttgcatt 
aaattattat 
aggccaggga 
aacacctcaa 
aaggagactc 
aaggtccata 
ggcaaccaaa 
agtaaaaggt 
ggggtatact 
ttgtgatttc 
caattaacct 
tgccatggct 
ttacacagcc 
ctctaagtag 
gaaatatgag 
tgcccaccct 
ctttgaccac 
agtttaatgc 
ctgagtgctt 
ttaccgagtc 
aagaaagtac 
cactgttctt 
acttagattc 
ggtttaagag 
aaccactcca 
cagtgtccag 
aaaacctacc 
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agccacgaag 
ggaccgagag 
gctgccgaga 
aagaagaatg 
acagacctcc 
cacacagcga 
ccctcaggac 
accttccctg 
ttcactgtga 
aagaacactt 
gcaagcacgc 
ctcaaccctg 
tgtgaaatca 
ctcattattg 
cttcagggcg 
ctcagtccat 
agttcagcct 
gaaagaactt 
agggagagca 
ttttgacagt 
gctgcagaca 
aaacaaaagc 
gcctcgagat 
atcccggaac 
cacacgtgtg 
aggaatagct 
gaagcccaca 
cctaagtgac 
gctttcaagc 
aacttaacta 
tgaatctgcc 
ggggagagag 
tgttttacca 
gaggtactgg 
agaattgagc 
ctggcccctg 
ccgcagacta 
ctcggcaggt 
agtctctgca 
tttaggagca 
cagaaggaaa 
nnnnnnnnnn 
nnnnnnnnnn 
atctctatat 
aatcccattt 
tctagaagag 
ggcagcttat 
ataagcgcag 
atggttgtga 
tgccttccac 
cttagtactt 
ttctatcatt 
gcgtgaagta 
cccctgtgtc 
tgtgtgcaca 
ccttgaaata 
gatgagatgt 
cccctcttct 
gcagtgctgt 
aggcaaagtg 
tttgtttgtt 
tagaattcta 
gcaagattct 
caggtattct 



cttataggcc 
gctccgtcca 
gccgtcaaac 
tacaggatgg 
cactcgggaa 
aggtcttcaa 
tcggctcagt 
acatgtccat 
ttttaatcga 
tgtaagagct 
tagtcacgca 
gacgcaccta 
aactgaactg 
aaggtcatct 
ccctttgaga 
caaagcccgc 
gtcagctctg 
gtccacttgc 
cattagaact 
tagacaccat 
ttgcagaatc 
agctcagcca 
gatctggaag 
cctcgggatg 
catgcataca 
taatattccg 
ggaaggcgtg 
ggcttacttc 
tgactgcact 
aatggctaaa 
atcacagtta 
ccccaaagct 
gcatgtatgt 
atcactgaca 
ccaagtcctc 
cattttgctt 
gttaaagctg 
atcaccccag 
gacggagact 
ctttatctca 
gccacaatga 
nnnnnnnnnn 
nnnnnnnnnn 
atgaatgagt 
acaggtggtg 
caatcagtgc 
ttttaattgt 
gcactcatag 
gccatctgat 
ctagaattac 
tcaacatagt 
ctctacaccc 
tgagttagag 
tctgagacag 
gcgctctcaa 
acagatttct 
ttttgaagga 
acagagaaga 
actctaaggt 
cagactcata 
tgtttgtttc 
ttgttgatta 
tcccatattc 
cactgagctg 



tctgggatgt 
ccgaagtcag 
ttgtcagcct 
ttttatggag 
ggagcattgc 
atacttactg 
gcaaggactc 
caaatagata 
atcttcatag 
cccatcaagg 
tcaccccaaa 
ctagtgcgtc 
tgggtaagtg 
catagtcatt 
agtaatacac 
cccacaatgc 
actcaacacc 
tcattgaaag 
gcgctgctag 
cctcggagca 
ccacatatac 
cccgagccta 
acaaaggtgt 
gagaagacaa 
cacgaaaata 
ataatgaaac 
ctgaactcca 
acagtcagga 
gttacttttc 
accttaccca 
agatatctgc 
ttgcccttcc 
gtgagcatca 
cgggagtcac 
tcaaagaaca 
taagtaaagc 
agtagctgct 
ggcttcaact 
atttacagtc 
tgtctgaccc 
ccgccatgtt 
nnnnnnnnnn 
nnnnnnnnnn 
atcctgtagt 
gtgagccatc 
tcttaacccc 
tttaaatcac 
aggtcagaga 
atgagcacca 
agtatatgca 
gtaaatcgga 
aggcttcatt 
tcagacaatc 
gggaggattc 
ccatttgttg 
ccattttgtt 
actttctagt 
aggaaacagg 
aacgtgcgtc 
accagtccat 
ttcttcttta 
tattcacaaa 
cttcagaccc 
tacagcattt 



caggattcac 
agactcgggc 
gcaaatcaaa 
aagagtcagt 
actacaagaa 
gatagcacaa 
acatcttctc 
tttctcttag 
ctcttgcaga 
ctcctcttta 
cactggagac 
tttagccttg 
atggccatct 
tttaacagcg 
ctgcaaaaga 
agccttccaa 
gtcttcttac 
cagtagctct 
gccctcgctc 
acagtccaag 
agttacattt 
cagttgtaag 
ttcctgtgcc 
tcgccccacc 
attcttacat 
ctaatattct 
atgattcagc 
gctggtgcca 
tgtcaggcaa 
gacaagctgc 
tgtgctacac 
cggtatcatg 
ctgagtgtct 
agatggttat 
gcgagtgctc 
ctagtgagta 
gctccttctg 
gtgcccctag 
caccaatctg 
tgtgccccag 
cctcggagct 
nnnnnnnnnn 
nntttttatt 
tatcttcaga 
atgtaattgc 
tgagccccct 
gtatatgttt 
tgtaagatgc 
agtggacttg 
attagctgct 
gagcactcct 
gggacgtgtg 
agtgctctct 
tcctgttgga 
ccatttcaaa 
tgttcccttc 
agttactcgg 
aaattccaag 
caaccagcag 
tgttttgctt 
ttagtcggat 
caaaagactg 
atagcagcag 
gtctgaagaa 



aatgacagtg 
tcctttgatg 
cgcacgcaca 
caatgtctgt 
gctgcaataa 
ctccctgagc 
cccacagagt' 
caacttctct 
ttccaatgat 
aaacacggca 
aaaaatcacc 
ctctctaggt 
tcacagggaa 
gagaactgtt 
ggccaagtca 
actgtgctca 
cttgttcact 
gatttcaccc 
tcaacagcga 
tgtaaagaga 
tcaagctatc 
ttaacacata 
aacctggcca 
aagttattct 
cttatcaaaa 
gggtcccgag 
actcttcctc 
aaaatgttcc 
tatgttacag 
aggaaattag 
cgtgacaaga 
gcttatttta 
gatgcctgag 
gtgctactct 
ttaaatgctg 
taaatgatca 
ctggaggcaa 
caatgtaatt 
tattgctaca 
atagcatcaa 
gagccccacc 
nnnnnnnnnn 
aattatgtat 
cacac'cagaa 
tgggaattaa 
cccttttttt 
ctgtatgtag 
ctttggagct 
ggtcctctgg 
cagccacccc 
tggtgcaaag 
atgccagcga 
gctctgagtg 
tcacatagtg 
tacagtgact 
tttcgtcaca 
tggaaaagga 
aaaaggagta 
ttgaacaagg 
tggttagttg 
tggatttggg 
tttatatgtc 
caagcttagt 
ctgatgaaca 



ctggatgaga 
gccatcacgg 
catcactgac 
ggactctaag 
ccgatcatct 
taaagcccca 
tgtggtcacc 
gttgttcgac 
gtctgaaaca 
gggacgaaag 
actctttgcc 
caccgatccc 
ggacagaagt 
tgttagcgat 
gggcaagcta 
tggctctcac 
tcaataaggg 
tgcagaagca 
gagcaggcca 
ccctcacagg 
agctctggta 
aaacgaggga 
ccctagttta 
ccagcctttg 
catttttaga 
tgatgggctg 
acgcccgcaa 
aacaagtcat 
atggataaga 
tttctgatac 
ggaagaagag 
cgtgtgtggg 
gtgaccagga 
gttggtgctg 
agcagtctct 
gaagccctcc 
acccgccctg 
agagcgctgc 
gacgcgtccc 
aggtctccaa 
cgnnnnnnnn 
nnnnnnnnnn 
agatatagat 
gagggcatca 
actcaggacc 
tttttttttt 
ctatgttcac 
ggacttaaac 
accgccttgc 
ctcagtcctc 
attctgtgtg 
tgattcattg 
ggtctccctg 
catagagccc 
tcagagtctt 
aagacctgag 
caatgatgct 
cacagatgct 
cggaagatag 
ggttggttgg 
aaaaatgtag 
ctgaattttg 
ggccctttgc 
ttttagctta 
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tgctcctatg 
aatgcctctg 
caggtgaatc 
gatatgtagt 
aaaaagaaag 
agaaagaaga 
tgttaacata 
tttaaattat 
tgctggccac 
cagatgtgag 
ctcttcgcct 
gtcgttcttc 
gaactgcatg 
tagagtcaca 
cagtgttctg 
ctgagtcatc 
caccacgctg 
ctgagtcatc 
ctgtgttatt 
cagagatctg 
attaaaagaa 
gggaagaagg 
acgtgagctc 
gctccacgct 
ggttcagagt 
actttccgaa 
tgtaatggtg 
cccatggaaa 
ctcaggctca 
ctgcagggaa 
catacacata 
catcagtcta 
atggagaggg 
tgcaatttta 
gtaaaagtaa 
tcataaccag 
tgtgagaatg 
ccactctttc 
caagtaattt 
agatctaata 
aatattttta 
gggtcaagcc 
tgaaagaatg 
tttgttagta 
aagagttgca 
cagccccata 
aacagcctag 
tgactttggc 
atccttgggt 
caagtgggcc 
tgagttcagt 
tctgacctca 
cactcgggag 
atttacagtt 
tcagtggtag 
agagagagag 
aaggaaggaa 
gaaagaaaga 
gaaagacaaa 
gagggcatag 
cctagcaccc 
actactgtgt 
agtaatctgt 
ctagcctaga 



ttagtaaata 
gctgagtttg 
tctgagttca 
aagactctgt 
aaagaaagaa 
aaatgagtgg 
ctactattaa 
tattttatat 
cacacataca 
acacgcctct 
ctacaatcaa 
ctggccagtt 
tgcacttggt 
ggttatgtat 
gatttaactg 
tctccagccc 
gatttaactg 
tctccagccc 
ctaactcttg 
tatgggctta 
tagagtgagc 
tcaggagttg 
tatgagaccc 
accacttgcc 
attgtgcaca 
aaatattaaa 
taagaaagta 
gttgcttaat 
cgagtgctta 
tccaaaaccc 
attttaaaaa 
ggctacacaa 
agatgcactt 
tagagagctg 
taaatgttac 
tgaagtaccg 
tgtttttctt 
ccttcacgat 
tccatagcta 
ttggcataat 
tgtacagcca 
tctgagtgac 
tgggaatgct 
ttatacattt 
aaaatatatg 
atctaccttg 
tatggtggcg 
tccgaaagcc 
acgtcactca 
aatgagatgg 
cttcagattc 
gtacacatgt 
gcagaggcag 
atataaagaa 
agcgcttgcc 
agagagggag 
ggaaggaagg 
aagagagaaa 
gcaaagcaaa 
atcagtgtta 
taccgcaaaa 
actttcacta 
tcttggtttt 
acacacagag 



gctaggtttt 
gtggcacttg 
aggcagcctg 
ctcaataaag 
agaaagaaag 
atttgccata 
caacaacaaa 
gtatggtgtg 
taggcagtga 
cctaggccta 
gctctcccaa 
gagcgcgcac 
acccagggag 
gggttctagg 
ctgagtcatc 
caccaagctg 
ctgagtcatc 
caccacgctg 
cttccttgga 
aaacaagaat 
ctaggagttt 
aaggctagtc 
tgtctaggag 
taacacgcat 
ccctgtatct 
gctacatcat 
tttggcttca 
gaaaacaggt 
ctgcccttgc 
tcttttggcc 
taaaataaat 
gaccttgtct 
cctattccat 
ctattttctt 
tttggaaaat 
ttaacatttc 
ttacaaaatt 
ttattgtacc 
actctgtgtt 
atttggcatt 
gtgttttgtt 
gctgaattgt 
catttcttaa 
atttaggggg 
gtgtggtttc 
ttaaatatac 
ttaacctcgg 
ttcctggaat 
tgtgcacaca 
ctccatgtat 
tgcaagataa 
catggtacgt 
gaggatctct 
actctgtttt 
tagcaagcgc 
agggaaaggg 
aaggaaggaa 
gaaagaaagg 
actaaataaa 
taatgcttac 
tatttgctct 
gcaacataat 
atttttcttt 
aactctcttt 



tagctagcca 
cctttagttc 
gtctacataa 
taaacagcca 
aaagaaagaa 
gcctagtcaa 
tactttacac 
ggaagctgtt 
aggttctttt 
tgtaagcagc 
taaacgtgtg 
aagagtatgg 
cccagaagag 
aatcaaaccc 
tctccagccc 
gatttaactg 
tctccagccc 
gatttggaag 
accctgagaa 
atgtccaact 
ggcacactcc 
agtcttagct 
aggagaaaga 
gacgcccaag 
ttaaaagtca 
ccactctaca 
gttttatggc 
aaaaataaga 
agagaccatg 
tctgcaggca 
cttttaaaat 
caagaagaaa 
gaagctgcta 
tcttttaaag 
atagagaagt 
tgcttctatc 
gggataatgc 
cattttctca 
ctgttacaga 
atgatgctat 
tgggatgaat 
tctcaggaaa 
gcaacactgg 
ataggaaata 
ttttcctgtc 
tgtatataag 
gttacttgag 
tccagctcca 
tacacatttg 
aaaggcagtt 
caggagagga 
gtgtagtatg 
gagttttagg 
gaaaaacaaa 
aaggccctgg 
agagggagag 
gaaagaaaga 
gaaagaaaga 
atacatacaa 
ctagcatgcg 
gtcttgctaa 
gagaatgttc 
gccaaattga 
atgaactcaa 



ccttgctagc 
cagcactcag 
caagttccag 
gaatgactta 
agaaagaaag 
actagatttt 
taaccatgac 
gcagagtggc 
gccaagacaa 
accagttctg 
cagaaggatc 
gttttcacct 
ggcatcatat 
atgtcctctg 
caccatgatg 
ctgagtcatc 
caccacgctg 
cagaactgag 
aattccttct 
cttgatctct 
tatgttccaa 
ttgtgacaag 
aaggggtaaa 
gtttccgttc 
atagtcaacc 
aacttgtaaa 
ctttgcatcc 
caggagggac 
"gcaggcagtg 
accacattaa 
gagctctaga 
gaaatgaagg 
ttttggtgat 
taatgcttgt 
ataaagagta 
cggccagcca 
tgcacttact 
cagtattaaa 
gaagaaatgt 
aataagcatc 
tagcacaaga 
ggactagtta 
ttattattac 
tgctcatata 
ccctagttta 
gcatatgtag 
aactgctctt 
agggatgcag 
gtttttaatc 
atgcaaaggt 
ccaacccctg 
cacatgcaca 
ccagcctggt 
acaggggttg 
gttcagtcct 
ggaggagagg 
aagaaagaaa 
aaggggaagg 
tatattaaat 
taaaactctt 
attatattgc 
actatcccac 
tgggtgaaca 
gtttcttatc 



ttacaactag 
gaggcagaag 
cctggccagg 
aatattgcta 
aaagaaagaa 
cttggttata 
aagctatatt 
agttggctac 
gttaaccaat 
ggctcagggt 
ctgttgcagc 
acatttatgt 
cccctggaac 
gaagaacagc 
gatttaactg 
tctccagccc 
gatttaactg 
gctttgaaca 
attggctttt 
gattttatat 
gcttcaggaa 
tttgaggcta 
gtgaggtgtg 
ccagccctgg 
cttggaatta 
agcccttctt 
taaatgttac 
tgcaaagatg 
ggtctcccta 
cacatgcaca 
acgagtttga 
tttgctgtgg 
tatggtactt 
tgttttgact 
aaaaaaaaag 
gagtttttcc 
gttttgtagc 
ttttcagctc 
acttaattta 
tttctgtata 
gtaaagtgtg 
acatccattc 
atactattat 
gttcaatttc 
gtctcacttc 
gtgaataaaa 
gcagagaaca 
tgcctctggc 
ttaggaactc 
ctggatgaca 
cgagttgtcc 
gaagtcccag 
ctacaaaacg 
gggatttagc 
caactctgga 
gaaaggaagg 
gggaaagaaa 
aagaaagaaa 
tttaagactt 
ggcttctaaa 
tagttgtcag 
tcctctgtca 
agtatttcag 
cttttatgat 



140580 
140640 
140700 
140760 
140820 
140880 
140940 
141000 
141060 
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141180 
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ctccagaggt 
tttggttggt 
ttttgttttt 
agaaactggc 
cacgtgccac 
ccagcattac 
ttaagagttt 
ccagaggtcc 
gatctgatgc 
taaataattc 
ggatcataaa 
cactgctgta 
cagaaggttc 
ttatctgaga 
tctcattcaa 
ctcttccatc 
gcagtggaaa 
acactgacat 
cacaaacaat 
ttccaatagg 
gattccaaaa 
caaagacttg 
acatagcagc 
cctctacagg 
aagatattaa 
gtcatcatgg 
agctgactgt 
gcattatatc 
tagtgcacgc 
gagtttacag 
taaacaaatc 
aggtccttaa 
ctggagttaa 
agaagaacaa 
actgaggcat 
gagctgctac 
tttttaaaaa 
aaaaactgct 
taaattttgt 
tcattttaaa 
tcatttttcc 
ggtttgatta 
gactgtagat 
ccctatattg 
gagttccttt 
gtcatctctc 
tcacacctag 
aatctcacta 
actcacagag 
aagcccagca 
ctattgagcc 
gacaggccta 
agggttaaag 
taataagaca 
cactcaggag 
aatgggaggg 
tgtaatttaa 
tcatggtttt 
tccttttgtt 
gaaagactgc 
ctccaggagg 
ctggtgcttt 
caagaaaagg 
atagccctgg 



ttgtttttgt 
tggttggttt 
aatttaatgg 
tttgaacttc 
cacatctaac 
tttgactctt 
gatccagggc 
tgagttcaat 
cctcttctgg 
tttaaaaaaa 
ttcaaggtga 
ttcatttcct 
agtattcttt 
tgcagagtcc 
ggcctaacac 
tgggtcagta 
atgagctcct 
ccacccagac 
atgtagttga 
ccagcacttc 
tgtgtccaga 
aaagatggct 
tcacaatagg 
cagaaagcac 
ataatagccc 
gcagtgtaaa 
gagtggcttt 
cgtggatttt 
ctttaatccc 
agagtgtcta 
agatctgtat 
ggacaggaaa 
ggactgttgg 
gaatctatct 
gctatgtgtt 
taaaacctaa 
aagagtgggc 
gctgctgctg 
tgtatacttt 
ttggcactgt 
atttatacag 
cagaaactat 
ccaaaggaca 
atcatctctt 
catcacttta 
tttcagatga 
cctcttactc 
tgtagttctg 
atctgatagc 
aaatttacct 
agtcatggtg 
tgggttcggg 
ttttgagaca 
aatcagacat 
gtgagatcaa 
gaggaggaag 
ggcctaagct 
ctgtcttcgg 
ttcaggcatg 
agcccaagat 
caagacagac 
cctcttacca 
agactctccc 
gggaagagaa 



gggtttatta 
tgctttactt 
actggttcca 
tggtcttcct 
aatatctggg 
catagtttct 
tggagagatg 
tcccagcaac 
tgtgtctgaa 
aagagtttga 
gcctgggtga 
cattgatctt 
ttttcccaat 
cctttattcc 
tgaggacatt 
tgaagtgagt 
ctctgttctt 
ccattctctc 
caagaagtag 
tgtgtccata 
tattacctcc 
tgatggctct 
ctataacgtc 
agacatacat 
acgcttgagc 
agtaaaggtg 
cttctcatcc 
aagttaggaa 
agcactcagg 
ggcagccagg 
ttcaggaaga 
gtattccaca 
aatagttgaa 
gggcatggtg 
tgtggctaac 
aaaacaaaaa 
agaatgtctc 
ctgctgcata 
agctgtgctg 
acagaaatta 
aagcaatgta 
caaagtattc 
agtggagaaa 
ttacttctaa 
gatatataat 
gaaacaaaga 
actccatgaa 
gttgtcctag 
ctctgcctgc 
tcttaatttt 
agacaggctt 
gcagcctgat 
aaactttgtg 
aatattataa 
tccccagcat 
gaagagagaa 
ctcgactgaa 
tacatagtga 
tcttaaagac 
gatggtgcgt 
catcagtgat 
gattctgcaa 
agaacccatc 
cacacagatt 



gtgttttttg 
tttcttattc 
tgtggccaag 
tcatctacct 
tttctttatt 
tccagagtta 
gctcagtggc 
cacatggtgg 
gacagctaca 
tccatttaca 
actggcagaa 
cagaggtttg 
tctattcagt 
tgtgaccatc 
tcactgtgct 
attgaagaac 
ctgcacacat 
ttctgactca 
ctatgtcatt 
tcagctagat 
ttcgtttgtt 
tccagaggct 
atttccaagg 
gcaggcaaaa 
ttattcctct 
ccccgccctg 
cctaactgct 
atgacaaaga 
aagcagaagt 
gctatataga 
tgcctatgag 
tgtgtgcatt 
tgttcacact 
atgcacaact 
ctgggctaca 
tctaataatc 
tgattttgtt 
gcacacagct 
tcaactttgg 
acagccatat 
ctgtattaaa 
tctaaatgat 
aacttaggaa 
taatagtggc 
ttatctcatc 
cattgaaaaa 
tattcctttt 
aactcaatat 
cgagtgctag 
ctaagacgtt 
aatctttaat 
ctatagagag 
tcaaaaacaa 
acaagtatta 
aatgagggga 
aaaggaaaga 
gttgtccact 
agagtggaaa 
atagtggaag 
aagacagaat 
ctcgaaagca 
aggaatgtcc 
cctgcagctc 
tctggggcct 



tttggttggt tgtttgtttg 
atttttttat ttctttattt 



gatagcctca 
cccaagtgat 
ggagtttgaa 
ttacactttc 
taagagcacc 
ctcacaatcg 
gtgtaatcat 
ctggacttct 
gtggcagaag 
ctaacgggaa 
ctttagtagt 
tcgttgtttt 
atgccaatcg 
ctggtgtcct 
tgaaatcaac 
tgtcacctct 
gtccacagtg 
gtgctgctga 
tcttctaaat 
gggtttgatt 
ggtctgactt 
cacataaaca 
gatgatacag 
cccaggggca 
gcaacatcat 
ttagatctat 
aggtggatct 
gaaattctat 
ctgtttgaca 
ttacagaaat 
cagtgttctt 
gtattcctaa 
tagtcaatat 
tgggatttta 
cagatggcca 
aatatttgac 
aaaaaaagta 
tagtctagac 
tattcagtct 
gaaaaaagat 
gaattttccc 
catttattga 
ctaaaatgac 
tctaacttgc 
ttttttttcc 
atagaccagg 
ggttaaatgg 
tctcctctta 
ctcagcactt 
agttctatgg 
acaaacaaag 
gtgtcactca 
ggaggggaga 
tctcaaagca 
tttaatgacc 
ccaagggtca 
caggtatgca 
gcctggtgaa 
gcctcactgc 
agcatctgaa 
tcaccaatgc 
ccctctctgt 



actttgtagc 
gggattaagg 
agggattcct 
atttgttaca 
aactgctctt 
tctgtaatgg 
ataaataaaa 
tgaggcagca 
ctgtgtctca 
gtaagtggaa 
agatccctca 
tcagggagtg 
ctctgcagcg 
gcctgtggct 
tagcttgcaa 
cataggtgac 
catggatttg 
tagtatttta 
aagccaggca 
cccagcccca 
cctgttctgg 
taattgaaag 
ctctcctgaa 
tcagtgaggt 
caactgtgga 
ggccaggcac 
ctgtcagttt 
ctggaaagaa 
tgtgtgatag 
tggttataca 
caaatcaaat 
catgtagaag 
tggacagtca 
tattttcctt 
cagaacctag 
tatacgtata 
tcccagttta 
acattaaact 
tatctacagg 
tcaagaatct 
tttatcccct 
acatacccag 
ctgttgatga 
ctgcataaga 
tttgagacag 
ctagcctcaa 
atgtgtcacc 
aaaaatggaa 
aggaggcaaa 
gttaaagttt 
ccagactgct 
attaaaaagt 
gaaggaaatg 
gaacacagga 
cttttcatgc 
tcctggaatt 
tgggtatcct 
cctggggaga 
caagccaacc 
aataaagatc 
cttcgaggat 
cagcttatgc 



144420 
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146460 
146520 
146580 
146640 
146700 
146760 
146820 
146880 
146940 
147000 
147060 
147120 
147180 
147240 
147300 
147360 
147420 
147480 
147540 
147600 
147660 
147720 
147780 
147840 
147900 
147960 
148020 
148080 
148140 
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acacaccctt 
ctggtgaagg 
cattgccttg 
tgtctgaagc 
agccaggaat 
gggctgctcc 
agatttgaag 
acctgtttac 
agtgaagaga 
tatggaagta 
gaacacagat 
agcgtgcagg 
gcagatttga 
gcccaggcaa 
attggtattt 
aataaatcag 
agctttgagc 
tctaagacag 
tctcttaaaa 
caagaggcaa 
caggctacat 
caatagcaac 
gcttttttca 
catgtatgtg 
gccactgagc 
atttattcat 
ctaccatgtg 
tcctgaacca 
tattatatgt 
tagagatgat 
gcagtcagtg 
tggtttccat 
gaacttggtg 
aatgatggtg 
catactacac 
tggagagtga 
aagaaaagtc 
tttctccccc 
gccggaggat 
aggacgacat 
atgttttgcc 
gggcatcaga 
ggaatcaaac 
tttattacat 
cccccccccc 
tctagttaat 
cagattaata 
tacactgtcc 
tgccaattta 
agcagtagga 
agttagtctc 
cctctcaggg 
gtgagtggaa 
atgggtgaga 
ctctgacaat 
ttattggcag 
ggacagaggt 
cccaggaatc 
tttcaccaag 
gagaaggaga 
tcaagacact 
tctgccttca 
tggctcggct 
ggtttggttt 



tgaaaggccg 
aagatgcttt 
agaacaagtg 
acactttggc 
gtatgcagca 
tcacactcgg 
atgttgatag 
ctgaaagcaa 
agagcaagat 
attataatgc 
tcatctgtgc 
cccaaacctt 
ctgtttgggg 
taacatccat 
caagagctaa 
tgacctaaaa 
acatttctgt 
ctttagctat 
acttgagcca 
aggcaggcag 
agtaagatcc 
aaattaattt 
gacacaccag 
gttgctggga 
catctctcca 
tttgtgtgtg 
tgtcttggag 
tctccctagc 
aagtacactg 
tgtgagtcac 
ctcttaactg 
tttatccaaa 
tgtgagcatg 
gcctgcctga 
actagctgag 
tcatctccac 
cttttaaggt 
tccctcattg 
ctttgtgaac 
agatataccc 
tgcatgtatg 
tctctcagaa 
caaggtcttc 
attttcctca 
ccccgagcag 
taagctgaaa 
ctctgttagc 
agattttgcc 
tacagtgttt 
tttgaaagct 
tgtggtacac 
tgtccagaag 
ttctgcagca 
gggccttctc 
cttagcctgt 
taaatcagag 
cattggctga 
tcaggtgctt 
gttatgtatg 
agccctgctg 
gatagacttt 
ttggctcttt 
agtaaaggtg 
ttgttttttt 



ccatctagtt 
tgtaagtgtc 
ggataccaga 
cacagtacct 
ggcatgggac 
ggtcttctgc 
agttgtaagg 
gaatctggtc 
aggtgaaccc 
tatcttcagc 
catcagtgtc 
aaagtcccct 
agcagccatg 
atgtcacaca 
tgtttaagga 
ggaaaacaca 
gaacccagag 
acagcacgtt 
gagccaggtg 
atccctgaat 
tgtttcaaat 
tttagatgta 
aagagggcat 
attgaactca 
gtccattaat 
gtatacatac 
atcaaactca 
catttctagt 
tagctgcctt 
catgtggttg 
ctgagctatc 
gatgccttta 
aagaccagag 
gattccagga 
ctgtgtgctc 
aggcaagcac 
ggtggtggtg 
tgatggcaca 
tggaggtcag 
tgtctcaaac 
tctatgcaca 
ctggaatttc 
tggaagagca 
attacatttc 
caagtattct 
gggagggagg 
ctaactacac 
atcctttttg 
ataggtattc 
gacatttatg 
attttggggt 
ctgctgcaca 
tcccggtcca 
caagggcctt 
gcatagtttt 
gttatccact 
aggataacgt 
gctgactggg 
gcagagggta 
ataagggaag 
gtccttaatt 
gagccactgc 
tttgcctcta 
gagacaaggt 



gccacaaaag 
cttatcccag 
gttaccagtg 
tgtggcagct 
ctctaccata 
catgttgccc 
atgctttgtg 
aaggcatttg 
ccaattcctg 
cttttgcccc 
tccgcactgt 
aagctgaaag 
ctgcgacaca 
ggtaagtcag 
gaaaaacact 
gccgtcttat 
cttgggaggt 
tgaggtcagc 
gtggtggcac 
ccagcctcgt 
aaataaatat 
tttatttatt 
tggatcccat 
acacctctgg 
taaaaattta 
tataatacag 
ggttcttagg 
attcttttct 
caaataccgg 
ctgggatttg 
tccagcccca 
agggcctggg 
ttcagatccc 
caacggagac 
aagagagcag 
acacctgagc 
ttgttgtttg 
ttcctttaat 
cctgttctac 
agacaaaaat 
ttatgtctgg 
agacacttat 
gcaagtagtc 
caatgctatc 
tcattgctgg 
tagatgttgc 
tatagaagct 
ngtgtatatg 
tttgtgacgt 
tggcctatgg 
ctatctgcgg 
ctgggctgga 
gctgggagtg 
cttagtgtta 
ttattcgaga 
cttagggaag 
actgagatgc 
tttcttaggg 
taggggtttc 
tcccccattt 
agcagggccc 
tacaaaatat 
agcctgaagg 
ttctcagcat 



acattctccc 
gagaaatgcc 
gagacctcta 
gcagctatgc 
gcacagagtg 
tgagaatgac 
acgaggttgg 
attacacaag 
gatgcaatgt 
atactgaaaa 
gcagacaact 
cagtgacccc 
tgctacagct 
aaatggtttt 
ataaaggaag 
atatcattat 
ggagatagga 
ctgaactaca 
atgcatttaa 
ctatatagtg 
aacaaaaaca 
ttatgtatga 
tacagatggt 
aagagcagtc 
aaactagagt 
gttcataagt 
catgggagca 
" tttgtcttga 
aaaggggaat 
aactcaggcc 
tttttagtat 
aagatggctc 
tagcacccag 
agacaggggc 
ccctggctta 
acacagacat 
ggggtctttt 
ctcacatctg 
atagcaagcc 
tatttatttt 
tgcccatgaa 
caagtactgc 
tttttttttt 
ccaaaagtcc 
gccatctccc 
ccaaacttag 
tattctttag 
tctacagatc 
ggatctttta 
tcctgttaaa 
ttccgcatct 
aggatgaagt 
aatgctgggg 
cagctctagg 
tgagcttgta 
gagataggaa 
tcattagcac 
ggttgagagg 
aggctcccca 
tgagccactt 
aacaagtgtc 
ccaaatctgg 
cctgagtttt 
agccctgggt 



aggagaactg 
acgacctcat 
ctgtcaccga 
caagtattgc 
ttctcttggg 
tcttttagcc 
tagcacagac 
tcagggagag 
caatggaaag 
gcatagccca 
caaagctgac 
agggctgtgt 
gcagtgtaat 
tacttacatt 
cctggcatca 
gctattgaga 
tgattaggag 
tgagaacttg 
ttctagtact 
agatccccac 
gcaataataa 
gtacaccatt 
tgtgagccac 
ggtgctctta 
atttttaaac 
caattctctt 
agtatttact 
aagatttatt 
caggtcttgt 
ctccagaaga 
tcttattaag 
agtgggcatt 
gcagatgctg 
cctagctaac 
ctgtgcagga 
gcacaaagga 
gttttgtttt 
ggacaaagag 
catttcagcc 
atatatttga 
agccagaaga 
ctgagtgcta 
ttaatatttt 
cccataccct 
catctccttt 
gatttattga 
actttcacat 
ttaattcagc 
cccatcttaa 
tcacatttca 
cacacttttc 
ggagtccaga 
tcaggaggag 
caaaggcctt 
tgcatacact 
tacccaaggt 
ggggaggcat 
gtagcagtga 
gacaaagaag 
cagaaggcta 
tgttttcttt 
gccaggaagc 
cacttgattt 
attctggaac 



148260 
148320 
148380 
148440 
148500 
148560 
148620 
148680 
148740 
148800 
148860 
148920 
148980 
149040 
149100 
149160 
149220 
149280 
149340 
149400 
149460 
149520 
149580 
149640 
149700 
149760 
149820 
149880 
149940 
150000 
150060 
150120 
150180 
150240 
150300 
150360 
150420 
150480 
150540 
150600 
150660 
150720 
150780 
150840 
150900 
150960 
151020 
151080 
151140 
151200 
151260 
151320 
151380 
151440 
151500 
151560 
151620 
151680 
151740 
151800 
151860 
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151980 
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tcactctgta 
gcttagatta 
gatggctcag 
gcaaccacat 
ctgaagacag 
aagaacacta 
ttcattctag 
gaaaggccac 
ttctacctgt 
gatggttgtt 
gccaccacat 
agttacccat 
gcacatattt 
agcccagttt 
aaaaacaaac 
ttgggcgtgt 
ttgggaggca 
gaaaccctgt 
gttaatatga 
ggagacattc 
cnnnnnnnnn 
nnnnnnnnnn 
aaccgtcagg 
attaaagcac 
gagaggcaca 
ctgggagaag 
agcagctatt 
tctgcagggg 
ttgtgtctat 
gtgtgtgtgt 
gtcagtgctc 
ggccaggctc 
taggaaataa 
aattggattc 
tatgcagagc 
gactgggctt 
tgcccgtcct 
gctactgaaa 
cagggccctc 
ctcctgaggc 
ccatccaacc 
agaaacctct 
gctaatgtct 
tgtgttgcat 
tgattcttgc 
gcccagggac 
tttttttctt 
tagtttataa 
tttatatatt 
taagtggaca 
aaatgtcatc 
ccgggggctg 
gtggcatagg 
tctgagattg 
tccaaaacca 
agcttcttca 
cactgtgctg 
atacccttcc 
cggtggctga 
aaaaaaagaa 
agtgctgggt 
ggccaggagt 
acggatgggg 
gtcaaagctg 



gatcaggctc 
aagttgtgcg 
tgggtaagag 
ggtggctcac 
ctacgggtgt 
ttcacaaggt 
gggaaaaatc 
agcacgtgta 
ttttgaggca 
ctatttctgc 
aagcatggca 
caagccatct 
aatcccagca 
acaaatcaag 
atccttctgt 
caaaaatcac 
gaggcaggag 
ctcaaaaaaa 
aatagcagaa 
catactgaaa 
nnnnnnnnnn 
nnnnnnnnnn 
ctcagaggca 
caaatcttgt 
catagcttcc 
cttgaagctc 
cagcagctgc 
aaagacaagg 
gtgggtgagg 
gtgtgacaga 
cctttggctc 
gcccacccca 
aaaacagggt 
agcatttctt 
acaagagact 
gcctgcaggc 
ggggtcatcc 
caccctcatc 
agtaggtcac 
cagagctcag 
cattggacta 
tccccagctc 
gcagcttcta 
ataaaaacga 
tgcttctctt 
tattgttgct 
gctagaagtt 
acatgataag 
tatatatata 
taggtataaa 
cactggcgat 
cctgagacgt 
gttggacagg 
ctggttgccc 
gcagggtcct 
aggggcgcat 
gagtggatac 
cgtctacggc 
aggcttcagc 
agaaagaaag 
gacggagccc 
ggggccccaa 
gtcctgcgag 
aacttgtgct 



aacttgaatt 
ccaacactgc 
cacccgactg 
aaccacctgt 
acttacatat 
acaacaacag 
tttttaaact 
tccaggttag 
gtctcttgtt 
ctctcatgtt 
ccagcactga 
tgctagcccc 
ctcaggaggc 
tcccagaata 
gttccagtca 
aagtagcaag 
gatctctgtg 
ccaagcaaaa 
ctcaagatag 
tatatgcaga 
nnnnnnnnnn 
nnnnnnnnnn 
ggatggcagt 
actaaaaaca 
gggcccttgc 
aagaactggc 
ggctcttggg 
agatggcact 
gaagtgccgt 
gacagaaagg 
ctcctgccaa 
ccaaagccca 
tcttgtccaa 
actcctcagt 
gtgcttgcca 
tgccttctgg 
aggaggagca 
tgctaaggcc 
tggccaaaga 
agcctccctg 
acactgtaga 
cgcctagctt 
cagtaggggt 
ggtgatgttc 
ccttcccgac 
ggctggggtt 
accgatacag 
acatacagtt 
tatatattat 
actgcacctt 
ctctcctttg 
caggaatgag 
gcaggttaac 
cagctggctc 
tcagcccttc 
gtcccgacac 
ggtccagagg 
tgtacatttt 
agccttttcc 
aaaagaagaa 
tgggtcctca 
gggccgtgcc 
cacacttgcc 
gatagtcggg 



cagagatctg 
ccacctaaaa 
ctcttccgaa 
gatgagatct 
aataataaat 
tgtactagga 
ttagtgtgta 
gagtcaactt 
tctgccctac 
gccctatgca 
gccaagggca 
agaaaatgta 
agaggcaggc 
gccaaggcta 
caatgactgc 
gtgtaatggg 
agtttagcac 
atagaattac 
ctaatgaaac 
gcctgatgct 
nnnnnnnnnn 
nnnnnnnnnn 
gatgtgtttt 
gccctctgct 
cctctgctct 
cgctgttgcc 
caagcttgga 
gagctgggta 
gtgtgtgtgt 
gagagtgcat 
aaagcatgct 
ggatctgcca 
gagaattttt 
atcctctctg 
gaaacccttg 
gctgacccct 
ggaccactgt 
actagatttt 
gcctagatgt 
gcctgcacaa 
cgctgccttg 
gcctccagca 
gacggggtgc 
taagtatcta 
tcttcccact 
ctgaacaaaa 
tccttaattg 
tggtaaggag 
gtgggtgtgt 
ctgtgtgact 
gtgattggtt 
aggcattacc 
taagtgatct 
tgggagagcc 
tctccaaatg 
cactaatgac 
cgctcggtca 
gggcttataa 
tgaaacccag 
aaaaattctg 
gcggagagtg 
tgaggatgcc 
cttcctcttg 
ggcatagtcc 



cctgcttcta 
aaaatatgag 
ggtccgaagt 
gatgccctct 
aaatcttaaa 
aactttaaaa 
agaaagagag 
ttcagaagcg 
actttgtaca 
tgctgagctt 
ggcatctggc 
tttttgacag 
agatctctgt 
catagagaaa 
tgtaacaata 
caggccttta 
agccagggct 
aagttaacca 
aaaggattct 
tgcctagcaa 
nnnnnnnnnn 
ngcctgggca 
ggggcacaag 
ctcgatctgc 
cccggtcagt 
tttgtgctga 
agtagctcgt 
gccaggatgt 
gtgtgtgtgt 
gttcttgtgt 
gtgttctcag 
ccccattaag 
attattattt 
gtagggaatt 
ggccaacagc 
gagtctggcc 
gatacagggt 
tttcccactg 
taaaactgca 
agtgctaaga 
tctgccagct 
ccctgacatc 
tccggggtgc 
agaatgttgg 
gacacctttg 
ttgccgcagt 
agcaaatata 
tgggttgggg 
gggcagagtg 
ctcattgcga 
cttggacccc 
cccatggata 
cagacaagag 
ttgttctgag 
accaggcttc 
tcactttgcg 
ggacagccga 
tccaccagga 
cagatcttcc 
ttccttgctg 
actgccagcc 
tgctgcaccc 
ggtcgtgcgg 
tgcagttcgg 



catcccgagt 
gggctggtga 
tcaaatccca 
tctggtgcat 
aaaaaaaaaa 
tagcccatca 
aggggctgta 
gagtctcccc 
tgaacttcaa 
actgatgcca 
tcacatggag 
gtgtggtggt 
gtctgaagcc 
ccctgttttg 
atatgaggat 
gtctcagcac 
gttacacaga 
gggtattggt 
attttataaa 
ctgaactaca 
nnnnnnnnnn 
agagtaagca 
gccccctttc 
agagatacga 
tcctggactc 
cgccagaacc 
tccttctcct 
gggaagtaaa 
gtgtgtgtgt 
gttcctgtga 
agtcaagctt 
aatgcggggt 
tttctctctt 
caggtctgtc 
cctattccct 
ctctgacctc 
tcctgagatg 
ccagactact 
gctaaagcct 
aggacattgt 
tgaaaacccc 
cacttctctc 
cagacaggcg 
tcccctgaag 
cccccagcaa 
tttttgtttt 
ggttcccgta 
gacctgtgca 
aggatatata 
gtacagttct 
agcaggtctg 
gggactgagg 
ccaagagtgc 
tcctgctcct 
cgcagagccc 
tgcctttgac 
gtgagacgtg 
agggtccaga 
acttaggaaa 
gacatgtggc 
ccagtatcca 
cactgggggc 
tgggcatgat 
taagctcttt 



152100 
152160 
152220 
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cccagagctc accttagaga tctggttccg gttcctgtgg ctggtgcagt tcttgcctgc 155940 

cttcttgtaa cctgacctgg agccaggcgg atggccatgt gggtggcctt tgtccctgga 156000 

ggccccatgg gacggatggt gctccttgcg ggcagccctg tcagaggtgg taagcgtgtg 156060 

agacttgatc tggtgaggag acactggtcc tgtgcagttc cggaagtcct ccaccctcag 156120 

cagcttcaga tcctggcctt gccgcagctc gggggtcgcg caggggacag cagagctaga 156180 

gccacggaac cttcgcagcc attcccacag ggaacgtgcc cggcagccac agtcccaagc 156240 

attcccattg aggcgaagga actccaaggc caccaggggg gccagacagt caccctgcag 156300 

ctcagtgagg ctgttgttga agagaaagag ggtggttagc ctgtggaggt catggaaagc 156360 

cttgtggtga acccactgta gctggttctc atgcagcagc aaccggtcca ggttcaccag 156420 

gccccggaag atgccttggc ccaggctcca tagcttgtta ccatggagaa acaagtgact 156480 

gagattgacc aggtccacaa agatgtcatc ttggaggtac tcgatatggt tgtcctgcaa 156540 

gtagagatac tgcaggctgt gcaggccacc aaagatgcct gcgggcaggg cgctcagtcc 156600 

acacttatag aggtagaggg cgtgaagctt caccaggcct tggaaggtct cgggtgccag 156660 

cgttcgcagc tgtcggttgt ctccaaggtc tagctcctcc agatgcacaa agccctcgaa 156720 

ggtgttggga gcaatgaaag tgatgttgtt ggagtagatc cagagggtga ccatggcggg 156780 

gctgaagtgg ccctgctgga ggaaggtgat gcgattgttc tgcaggaaga tgcgctcact 156840 

gtcctctggg atgccctccg ggatggcagc aaagttgtgt gcctggcagc tgacagtcat 156900 

gggcgcaggg tagcacacac agtctcgagg acaaccacca cccagaggta gctctccagc 156960 

gagcagcaac agcagcaatt ccacacagca ccctggtggg gagagacaga acagcagtga 157020 

ggggctgccc agaggaggtg gagatagatg gggaacagag ggtggaatgg gggactggca 157080 

aatgactctg ttggctcaca gaggttctgt cctctgtatt gtatgcaggg gtcccttgga 157140 

cagggcattt ggggccaagg cccacattat ctcctcacct ctttagctct gtccctaaag 157200 

tctctaattc catccgaaca cttcttcaga ctgtcagccc caccgcaggg tggaacacac 157260 

ttgtgaacac aggcgagtcg gcctccggct ctgggtccgg ctctgccact cgctcactgt 157320 

tagctgcctt agcaaggaat gactctaaca aagcaaatct ggagtcctga atgatcactt 157380 

tatttaaata attcctcaaa ataaagaaag cattgagtcc atggtaccaa agcatgcctc 157440 

aataagcgcc tctttcacac tgtggtacaa aaacttcaaa cctacaactc ctccatggct 157500 

gtttcctcat gagttaaaca cagttcacag ggctgtgtgt * acagacaagg cacatttctg 157560 

tgaggggctg tgagtgacac cagggctgac gcacgaggct tcccttgggg ttcacagtac 157620 

tgccagatcg aggctgcatg ctcctctccc ccattcacac ccccccctcc tgtgctggag 157680 

attgccaggc tgtggctgta aaaccgggcc ttgcctcttg actgtccaga gcatttcctc 15774 0 

tgtagcttcc ctctaattgg gcattaatta ggcattcgtt aatggatcct taaaataatt 157800 

attttcggat gtgtccagcc tgtggtcggg taataggcct atgctcatta tggaagccgc 157860 

ctcattatgg acgattgtca ttacctgcct ttttccaggg tcacagctgc cccaagtggc 157920 

ccagcaggcg cgtcaggaag atggggacag gctccaggcc tacgggcgcc caccctgaag 157980 

ggccaggcag ccacgaccta tgtcgcctca gttggcctct tgccccttct tttccagctt 158040 

gttcagctgg gactcttggg agagccaggg cccctggggg aatatgagct gagctgaatc 158100 

ttcttgctgc tagctgtgct cagagcaagt ggaaggagca gggaccttct gaccaggctt 158160 

cccacttggg gtcccaggcc cagggactgc ccaggccccg gcagagtagg ttgccacctt 158220 

gacttctgac gccccccccc cattcccaac agaaacagca tcgtaagttg acagctccca 158280 

gctgttggga gttatgggct cccagagagt ggcagctgct tctcgtccct gtaatcaccc 158340 

ggcttcagct agaatgtttc tagcacataa aaatcatcgc atataattta gtttttgcat 158400 

aattgggttc agttgtgatt tcagagcaat tatgacctca gcagcagggg tggcacagca 158460 

taggacccct ttctgggccg ggccatgccc tgcaggccct ctgcagtgct ttctgcccac 158520 

cggcccttag acagcatgca ggctataacc atctctgcct caatttcctg ctccaaaacg 158580 

tgtctagatg ttactctgtc gatcttcctc ctcagcatcc tgggtgtggc ctccagcctg 158640 

ccagcctctg tcctagggat cctgtgctgc agagggaggc acagtcggag ggaggggagc 158700 

ctgccctgtg ccaccagcac tcacactggc tgcacagtcc acagacccac agctccaacc 158760 

tccctgcttg gcttgcaccc tctcttccag gaaggccatt cttgccagaa cctttcccaa 158820 

cggtcccctg ggaaagcctg gactctaggt tcaaggacat tcatgatgct tgccccacat 158880 

tttatgctgg atgagacaca gcagagcctt cttcactggg gggtcctgtg aaaatgaaag 158940 

cttttcttcc ccggcctgca gctgcaggca ggtaggggtt gcagtgggct tatcactaat 159000 

accattcgac atttgtacag ctcatcggag tttacagagg gcttttgttg taccctcaac 159060 

ttcccctgtg ttctcccacc tactgtggct gctctgtctc tgtgcacccc aaaagaatct 159120 

ggaagtccct • tggggagatt acccccctta cataggggcc tccaaggaat acaggctaca 159180 

gctctattta ggaaaaaaaa aatcaaactg aaccaaacct caggtgtgga cttagtaacc 159240 

agtttataaa cataccgtgg caagtggagg aggcaggcgg cagaacggca tgaaggtagg 159300 

actggggttt tccctctaaa agggcaatgg ggggacaaag ggactctagc caagacctga 159360 

tgcagaacca ggctcagttc ccctgttatc tcaaggctat catcactagg aggcctaagg 159420 

caatggacca caaggacctt gtccttgtag gacagtcact tcctgcagtg aagtgctctt 159480 

ctggaagcat actaatagga tgtaggctca ggacagctgg tctctgtcct ttagtatttt 159540 

tccacatgcc agggatgtta accttccaag cctccatctc ttctaatggg gggggggtgt 159600 

tgaggggctc agccactctg catccatgtc tttgaaagcc agtggtatta ctccaggacc 159660 

ctgagcaagc tgtcctagtc agccctggcc acttctggac tccttgcctg agtcagtagg 159720 
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tgccaatcct 
atggcgcctt 
attaatgtac 
aggaatccaa 
cttgggcagg 
cttcccctcc 
tcccacgcac 
acttgccccc 
atcatgcccc 
gaaattaaga 
cttggtcact 
tttgaactag 
gctcagacct 
tgggtcagct 
catggtccca 
atgtgcaaag 
agcctccaag 
accagaacca 
aggggccctg 
gtctttaatg 
agcatcagtg 
ccctgccttc 
tcgaaatggg 
agcacccaag 
ggtcagtggg 
ggccttcagt 
tgttaaccga 
ctggggctgc 
ctccttcctg 
ctcagcctcc 
cagggctgtg 
agcctcgttt 
cccacagata 
ccaaaccaag 
taaagatggt 
ctctggggaa 
actcgctatt 
gccctgcctc 
agtcaggaag 
ccccaggtct 
ccgtccctcc 
tacctggacc 
agcgatttgc 
acagcacgac 
tgctccctgc 
gtctctggag 
tcatagtagg 
ggagaacaga 
ccgtttgttg 
aaatagactg 
atgctaaaga 
cagaaatgtc 
aggtgcttgg 
aagtgagttc 
taaaaatctt 
acacacacac 
gggctgaact 
tgtctgtgta 
tggatggaga 
actcacatct 
cccaggccca 
gcccagcatc 
agcaccccag 
cccaggccca 



aggattgtca 
ctatgccccc 
caacaatgag 
gtctctgtgg 
gctgccttct 
aggaggtgcc 
gtgagctccc 
tagtctagga 
atttgcatat 
gaggggccta 
gatcccctct 
ggctgagaag 
cagcacacac 
ggcaagcccc 
tgggaccaga 
gcactggggt 
gaatactgta 
gcatcaacag 
tagcaccctg 
aaaagccctt 
ggctggttgc 
aggctggggc 
ctgggtcagc 
ggggctgccc 
tctctctgaa 
cctccctccc 
atgctctctt 
tggcgcctca 
gctcggccct 
actgctgtgc 
gagccagagg 
tcctcacctg 
ctggtaccca 
ctcttctctt 
ccctttacat 
attctgctgg 
gctaccattc 
ctcaccccta 
ggctgctcct 
ttctcccagc 
cacatgccag 
ggttcccagt 
aaggagcttc 
ataagtcact 
acacagtagg 
aattgaatga 
tgctcgataa 
cacctgaact 
gatgagcact 
ggactgggga 
aaaaaaaaag 
tgggtttatg 
gaggcaaaga 
taggacagcc 
agtgggggag 
acgttagtat 
cagagtgtta 
gatgtgtacc 
catggttgtg 
acctctggag 
gcaccccgac 
ccacgcccag 
acccagcacc 
gcaccccagg 



ccagcaggtt 
tctatgaggc 
gatggagcct 
aggggccctg 
acttgagggg 
cttcatcgtg 
aaggggctct 
gagccatgga 
gcggtggcaa 
aggaaagcct 
gtaacaccag 
ggtctgctct 
tcaacagaag 
aaactcttcc 
cctggtagga 
gggaaagggc 
acctcagctc 
ggcctgtctg 
gccaccagaa 
ccggaagcca 
agctggagaa 
tctagccagc 
tttttcaggc 
ctctacccct 
gctttgtagg 
tgcccccagc 
gctgtgggga 
ggccacttgg 
gaagacaaga 
agagcggccg 
cctaagttga 
caaaatgggg 
cctgcccttc 
cctctccggc 
cagctgcctg 
ctagatttta 
acccagtagc 
cacctccttc 
tagtctgcat 
ccagaggccc 
ccctcagcct 
gtgtctgcag 
tttcctggga 
tgtttccttt 
tgctcattaa 
tgacactcag 
atgttggttg 
tggctcacat 
ctgagaagga 
gtttcccaag 
ttagtcactc 
aagagaggaa 
cagatggagc 
aaggctacat 
cagtcaagga 
aatatcatac 
gtgtgtgcta 
tgaggtgttt 
tttgctaggc 
actgagagtg 
ccagcatccc 
catcccaggc 
ccaggcccag 
tccaagcact 



tcttcctagg 

ccttgggagc 

ttgccatgca 

cacctcttct 

gcggaaggga 

ttctgcttcg 

acagcctccg 

agacagtgtg 

taccctggtg 

ctgctatccc 

cccttctgca 

gttcagctgc 

gcatgcaagc 

tgccaagctg 

taggtggcaa 

ctgtgcttct 

agctgggctc 

ctaaacccag 

ctaacgagga 

catttgcaca 

cacagcaggg 

tccctgatgc 

tccaaagggt 

aagtagaggc 

ctgcttcttg 

ctggccagct 

tggcggcctc 

cacttgtgcc" 

aagctggcca 

gcagcctcct 

atcctggctg 

cagtcattgg 

ccacatctcc 

ctcctccgtt 

ggaggaacac 

gaatgtatcc 

ctcctggatg 

aacaggaact 

cctccagagt 

tgcctctccc 

ccctcctgcc 

ggctgcttcc 

cattttactc 

atgtccgctg 

cagttctggg 

gaagccctag 

taattagtcc 

caagactcct 

cccaatacca 

gtatgggatt 

taaaaggggc 

gccagatatg 

tctgtgagtt 

agagaaaccc 

ggaaacacac 

tatggctctg 

gtggatattt 

atgtgtacac 

atttgcatgt 

acaacccagg 

aggcccagca 

ccagcacccc 

catcccaggc 

caatgcccag 



gaggcaagca 
cccgccccac 
ttttaacatt 
gccagactca 
gaagacccag 
ttactctcaa 
tcattccttc 
ggaagggctt 
ggtaccagga 
tgggctacca 
acctgccaga 
cttggctggg 
aagggagcta 
agcatgaaaa 
ggctaaggca 
caccccctct 
gggtggccag 
acctcacaag 
agatctgacg 
gaagaaaatg 
ggacaggtcc 
ctggagtagg 
caggacagct 
atccccatgg 
gccatgtagc 
gctcttctct 
aggccaggcc 
acttgtgttc 
gggaacagct 
atccatggct 
ctccttttaa 
aagctcagcc 
atctgtctaa 
cacaacttct 
ctcagaatca 
agtatctatc 
ccctcccccc 
agggtagtcc 
gcccatctaa 
tggctacaca 
tttgtccatg 
cagagaagcc 
tacagcacgc 
ccact"€tgtt 
ggagggaatt 
gctcaagcct 
tgggagactc 
taggcccatc 
gtcctttcct 
ttcagcacta 
caggaccaaa 
gtggggcaca 
tgaggccagc 
acttgacttg 
acacacacac 
tgcctgcagt 
gagctctgta 
aggtgttggc 
gtctgagttc 
gcccttttat 
ctccagaccc 
agacccagca 
ccagcacccc 
caccccgacc 



ctgtatcacc 
tgattgcctg 
gcaaattagc 
tcaagcgcct 
ttccactctc 
gcctccggcc 
ttccattcat 
gacaatgagc 
gagtataggg 
gtcagcattg 
gtttttgacc 
aggggaatct 
gcagtggcct 
gccacctcac 
gcggaatagc 
aatggtgcag 
agagcttggc 
ccagtttagt 
ctgggaatat 
aggtgcccag 
taccaagcta 
taaagcagcc 
gctgcagctt 
cccctgggca 
caatctctct 
gagcaatcga 
agctgcactc 
taaacacagg 
gggctcccat 
gtgagtagaa 
tgcttgcagg 
agtcctccag 
ctgaaaccat 
ccatcttggg 
ccgtggctca 
cacttatcat 
ccccccgcaa 
agggaaagtg 
ctagaagctg 
gaccttgctg 
ctgttccatc 
actcttgagc 
gacctcttgg 
ctgatgagtg 
accttctcaa 
ggggcctgga 
agagccttca 
aaggaagtga 
gggcaagggg 
aactggaata 
acctttcaaa 
tctttaatcc 
ctattctaca 
ccacccaaat 
acacacacac 
ccaggaatga 
tttatgtgca 
ctgttgcata 
atgcacataa 
cctgcagcac 
agcaccccag 
tcccaggccc 
gacccagcat 
cagcatccca 



159780 
159840 
159900 
159960 
160020 
160080 
160140 
160200 
160260 
160320 
160380 
160440 
160500 
160560 
160620 
160680 
160740 
160800 
160860 
160920 
160980 
161040 
161100 
161160 
161220 
161280 
161340 
161400 
161460 
161520 
161580 
161640 
161700 
161760 
161820 
161880 
161940 
162000 
162060 
162120 
162180 
162240 
162300 
162360 
162420 
162480 
162540 
162600 
162660 
162720 
162780 
162840 
162900 
162960 
163020 
163080 
163140 
163200 
163260 
163320 
163380 
163440 
163500 
163560 
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ggcccagcac 
cagcatccca 
atcccaggga 
cagggacagc 
tggaggcagc 
cctcattgct 
tcccagtacc 
gtctggggtt 
cagcacatcc 
tggagcctac 
tgggagtcat 
gtatgggaat 
ctgcacggga 
aactcaggat 
ccagggctaa 
tctgtcatcc 
ctacttctcc 
aatcttctgc 
tcccttttac 
tttctncncn 
nnnnnnnnnn 
aaattgaact 
ccttgtgatc 
tagtggttct 
aggcattgct 
tcttaagatg 
agcatgtgct 
acaaaaccaa 
attgggacac 
gaggatgaaa 
aaaatatata 
ttgactgtat 
cccaaaattc 
caggaatgct 
acacttccag 
atgtccttgg 
aagtatttat 
ctaggcctag 
cagcctttag 
tgattcagtc 
gtcttttctc 
tctaatctct 
ttctctcttc 
tgtgccgctc 
ctgtgctgca 
ctctcttctc 
tttgtctgcc 
tttaccttca 
tgagagtaaa 
tctccataaa 
ctgccatgtg 
aataagcatt 
cgactacatt 
agtgaatttt 
ttatattaag 
tttcttcata 
cttatggaaa 
cctcccatct 
tgaactcctc 
gagataaggg 
catgacccct 
ctagtgatgt 
agcctatgaa 
cgaccccaga 



cccaggccca gcatccaagg 
ggcccagcat cccagggaca 
cagcacccca ggcccagtat 
accccaggcc cagtatccca 
acatactgaa gatagggaat 
ttcagcacct gctctcctca 
aggattcccc tcagctgagg 
ggcagcccca tgctaattgg 
tgcccattag gcctaacctt 
cctgtatagc tctgtattga 
ttcacctagg caactccaga 
aggtaaggag agcatagtga 
gattgtgtca ctgtggttcc 
ggtccagaat ccagagctct 
gacccttcct tggacggttt 
ccatctgtgc ctaacagctc 
cagcatccca tgcccgcacc 
taggtttgag ggccccaggt 
caccaccact aatcctcttc 
nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn 
acacaaatga 
agattaaacc 



nnnnnnnnnn 
tacagaggaa 
agaaagctat 
ggggactgaa 
tatgctaagc 
cctataacca 
caaaaacaca 
acaacaacaa 
atttatacta 
agcccctgca 
gtgtatttat 
atctttgaaa 
ctgagttgga 
ctgatctaat 
gaggcagaaa 
agaggctacc 
ttgactgatg 
ccctggaagc 
gacttgctgc 
cccctgttcc 
ttggcctctg 
tgtctccttc 
catccttctc 
tactctccct 
ctctcttctc 
ctgagggttg 
actcaattag 
ttgtttcaaa 
gatgtgtgct 
taacagaatc 
gtgaagagaa 
ttcttgggag 
ttgccatgcc 
aaggcagcca 
aattgggaac 
tacaacttag 
ctgaagaata 
agagattgtt 
accctagagt 
cctcctaaaa 
tagttacgta 
aaacttgtac 
aaatttggct 
gctctggtct 



cccagcatcc 
gcaccccagg 
cccagggaca 
gggacagcac 
gtctctgagg 
cactcggaat 
aatgggtagc 
actgacagtt 
gcctgcaggg 
ggcactcccc 
tgggcacaaa 
gcctggctgg 
agactgccaa 
taagagagca 
cttggctttc 
tctgtggaaa 
ccagctcact 
tgaggctgtg 
ctcctcttcc 
nnnnnnnnnn 

nnnnnnnnnn nnnnnnnnna 



cccaaggaag 
tgacaaaggc 
tgggtggagc 
cataaagatt 
aaagtactta 
ttctcgttct 
gaacaacgca 
gtaaaacata 
agaaccagca 
ggttcaatgc 
caatggattg 
caaggtgagg 
tcatccttga 
taactgtcat 
ttctagcttc 
tgagatcacc 
gggctcaaac 
aattgctctg 
acactctctt 
tgtaaagctc 
ctcagctcta 
cggtaccacc 
ggcagatcct 
acatcacttt 
ttaaaggtga 
aataaggctg 
ttagggttca 
tgcagagagt 
ttggattaga 
ttgagactgt 
aggctgtggt 
aaaaaagcag 
tttttatagt 
aaaattttta 
cccggaacac 
tcgaaccctc 
caacctcaaa 
gattcccttg 
tttccctgcc 
ggtcgtcgat 
atgttccatg 



gctgtaatga 

ccagcccttt 

gacagctgct 

tttctgaatg 

aattctctaa 

gtttactgaa 

atagctcatc" 

atcacacgac 

cacaatgaaa 

cataaacaca 

gagtgctatg 

agagttcata 

accaactgat 

cctaactgga 

tccttccctt 

cttggaggct 

catacaatct 

gtttcctgtt 

tcctctcccc 

cttggtctca 

gcttgttctg 

tcccggtaaa 

ctgcactgct 

tgtgtctccc 

atcctgtcaa 

caagcaggag 

gtactaaggg 

agccaactct 

caatacgatc 

taaggatgtg 

gtcattcctg 

ggagggctga 

tggactgtta 

agtagaaagg 

tagcttttta 

ctgaactctt 

tcctgaactc 

ccaactaaaa 

atgaaccggg 

gcagaacccc 

cagctctccc 

tctcctctac 

tgctttcttg 



accccaggcc 
caggtccagc 
cccagtatcc 
gtatcccaca 
tggtccttac 
gtgcaggttc 
gcttttgtct 
gcaatttggg 
ggggcaggga 
cccatgccag 
ataagggtag 
cctgagcagc 
ctttcacccc 
ggcacttaac 
tctgtcccag 
tatgagctct 
atgttactca 
catctgtccc 
cctcctcctc 
nnnnnnnnnn nnnnnnnnnn 
caaaggacgc 
atactatttc 
ttttcatatt 



cagggacagc 
cccagcatcc 
gcaccccagg 
cccaggcaca 
cctcttatct 
caaacaccct 
taccattttg 
tctcctgaga 
gtgtgctgta 
caagctatga 
aatctctcca 
gcacctgaga 
gacatcttgg 
gatgctgaga 
tactctgtcc 
acatgaggcg 
gtgtgccctc 
ggtttccctc 
tcctcttcct 



cagaaaagga 
ttgcctttgt 
tggaatgttg 
catatggtag 
gagtctgtct 
gcagtatttg 
agtaggaaaa 
tattaagaag 
ttgaatagag 
taaaaagaat 
gtttataaat 
ctcttaggaa 
agattaataa 
ggaagtttgt 
tctccgcttc 
tactggctcc 
aatccaagcc 
ctttctgaac 
gatgatttta 
aactaactct 
tctttactgt 
cctgcctcct 
ctccacagct 
ttacgtagct 
acctttctct 
tgcctcctct 
tgtgtctctt 
agctagaaat 
aaatatcctg 
ctcttgaggt 
tgacactgtg 
gcgatggatt 
ctagctacgt 
acagttttgg 
tgttgctatg 
cttcacccca 
ttcaccccag 
actgttccaa 
tacattgcca 
ttgtcccttg 
cccttgagtt 
accactaggt 
ctgttgttct 



gagacctctc 
tatctattca 
gaacttctta 
caacagtgaa 
agaacaacaa 
attaaggtgt 
aataaagaaa 
agaaaactct 
gaaaagttcc 
gtaactaaat 
tctgaaggca 
caccagaggc 
ctggctgtaa 
atcagctaac 
tagaatgttc 
tctagctggc 
ttcacaatct 
agcaatcttt 
gtctagtttg 
cctccccctc 
ctcctgtatc 
tccctttcct 
gattcttcac 
acaaaccaac 
tttcagccag 
agtttctttt 
agacagctgg 
ttcagatggg 
acaaaggact 
aatacactgg 
ttagctggat 
cagaaggagg 
accaaaatac 
gaacccgacc 
aatgctttcc 
gaacattttt 
aataatagga 
acagaacccc 
ttactatata 
gcatgagttt 
attaaatctt 



163620 
163680 
163740 
163800 
163860 
163920 
163980 
164040 
164100 
164160 
164220 
164280 
164340 
164400 
164460 
164520 
164580 
164640 
164700 
164760 
164820 
164880 
164940 
165000 
165060 
165120 
165180 
165240 
165300 
165360 
165420 
165480 
165540 
165600 
165660 
165720 
165780 
165840 
165900 
165960 
166020 
166080 
166140 
166200 
166260 
166320 
166380 
166440 
166500 
166560 
166620 
166680 
166740 
166800 
166860 
166920 
166980 
167040 
167100 
167160 
167220 
167280 
167340 
167400 
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gccttctaca 
tgagtgcttg 
acagcgcgac 
ggtctgatgt 
gctcagtgag 
tgtccaccgt 
cctggtggac 
tcccacctcg 
agaccgtggg 
ctcgtgcaga 
tgtggtcgac 
ctcttgttgt 
tgactctgga 
gaaagggtcc 
cacctgaggg 
aggaaggggg 
tcccacctcc 
cccagtctga 
ttgacgacct 
ctgcagctgc 
ctcggagtcg 
tgccactcag 
agtactggcc 
cagagaaccc 
cttgggatga 
tcctcataga 
acctcgtgga 
gtaggggacg 
gacggcccac 
cagtcttcct 
cagaggggca 
agaaaaagct 
aagcagaaaa 
aagaaataga 
ttttggccgc 
acagggcagt 
attgcaaaga 
tcctaaccct 
taactttgtc 
cagtactcac 
ctggtagtaa 
tgacccactc 
taaccaaact 
aggcccctgt 
aacccaaaaa 
aagcaggaat 
ccacccccat 
ctcatatcca 
caccacttct 
gggaggttaa 
taagctctct 
tttgcctgcg 
agggcggaca 
ccaccctgtt 
agcttaccct 
gtcaccaggg 
ctaaaaaggc 
gcaaaagatg 
ccccacggca 
gctttgcaac 
ggaaagaaga 
cgctagcatt 
tagccagggg 
caaaaaaatt 



ttttgagtac 
agtgagggtc 
cacccagagg 
ttgtgttctg 
accgcgctcc 
ccgttcaccc 
ccctttggag 
tgcccagttg 
ttcaagtccc 
gggtctcaat 
gcagagtcgc 
gtctactgtt 
gcattggaag 
gtggcagacc 
tgcttttgac 
tcaccctgat 
gtgggtcaag 
tgcagccgga 
cctctggata 
cccaccacag 
ccgaggccga 
agcacatgtg 
tttttcctct 
ctctggactt 
ttgtcagcag 
ggcgagaaaa 
cgaggctttc 
cctccttgtc 
caatttggct 
tgagcgtcta 
gagagccgct 
gcaaaggctg 
agtctatcat 
ggagagggaa 
agttgtgaat 
gaaaccgcaa 
gaaaggacac 
agaagatgat 
cgtggagggg 
taacccccta 
attttacccc 
ctttctggtg 
aaaggctcaa 
tgcctgcctt 
tgcagtctct 
ggggttggct 
ttcggtaaaa 
gaggttgctg 
gccggttcaa 
caaaagggtc 
cccacctgag 
tttgcaccct 
gactggtcaa 
tgacgaggcc 
actacagtat 
aactgagagg 
acaaatctgt 
gctcacagag 
ggtacgtgag 
cctagcagca 
acaccaaaga 
accagacttg 
agtattgaca 
agatcctgtt 



ggtctcagtg 
tcccttcggg 
tcctagaccc 
tttctaagtt 
gagagggaac 
tgggagacgt 
gccaagagac 
cgagatcgtg 
acctcgcgtt 
cggccggcct 
cgccgtttct 
tttctagaaa 
gaggtgcggg 
ttttgcgcct 
ttgtcactaa 
cagatcccct 
ccttggaccc 
aagtctggcc 
gactcccaac 
ggaccaatag 
agccccgggg 
agagggccag 
tctgatttat 
actgggctcc 
cttttgcagg 
aatgttctgg 
cccttgaacc 
tatcgccgga 
aaggtaagag 
atggaggcat 
gtagccatgg 
gaggggctcc 
aagagggaaa 
aatagacggg 
gatagacagt 
ggtggcaaaa 
tgggctagag 
tagggaagtc 
actcccgtca 
ggcaagctag 
tggacgacca 
atacctgagt 
gtccaattta 
gtcctcaaca 
tcaggttggc 
aaacaagtgc 
caatacccca 
ggccaaggag 
aaaccaggga 
ctggacattc 
agaacatggt 
aagagtcagc 
ctaacttgga 
ctccatcggg 
gtagatgatc 
ctcctcacag 
caaactgagg 
gcccggaaaa 
tttctgggga 
cctctatatc 
gcttttgagg 
actaagcctt 
caagcactgg 
gctagtggat 



tcttcttggg 
ggtctttcat 
acttagaggt 
tggtgcgatc 
gcggggtgga 
cccaggaaaa 
catttggggt 
ggttcgagtc 
tggtcacgag 
tagaaaggcc 
ggtttctttt 
tgggacaatc 
tcagagccca 
ccgagtggce 
tcgctgccgt 
acattgtgac 
caaactcttc 
catcagcacc 
ctccccctta 
cgagaggggc 
aggaaggggg 
caccaggacc 
ataattgaaa 
ttgagtcact 
ttctttttac 
gagaggacgg 
gccccaactg 
ctctagtggc 
aggtcttgca 
ataggagata 
ccttcattgg 
aagatcatac 
cagaagaaga 
atcgccgtca 
caggaaaagg 
agataccact 
attgccctaa 
ggggctcaga 
acttcctgat 
gctccaaaaa 
aacgagctct 
gccctgctcc 
cttcagaagg 
cagaaaaaga 
taactgcgtt 
ctccggttgt 
tgagcaagga 
ttttagtggc 
ccaatgacta 
accccacagt 
atacagtcct 
tcctgtttgc 
ctaggctacc 
atcttgcgcc 
tcttggtcgc 
aactgagtga 
taaccttcct 
agactgttat 
ctgctggctt 
ctttgactaa 
ctatcaagtc 
tcgtcctata 
gaccctgaaa 
ggcccacatg 



tccgcggctg 
tttggtgcat 
aaggttcttt 
gcagtttcgg 
taaggataga 
acaggggagg 
tgcgagatcg 
ccacctcgcg 
atcgtgggtt 
atctgattct 
ttgtcttagt 
tgtgtccact 
caaccagtcg 
aacgtttaga 
caggcgaatt 
ctggcagaat 
gaaactgacg 
ccccaagatc 
ccccctgccc 
tcagggaccg 
gccagactca 
taatgatctc 
aactaaccac 
tatgttctcc 
aacagaagaa 
cacacccact 
ggactacaac 
aggtctcaga 
ggggcagact 
cacccctttt 
tcagtccgct 
gctccaagat 
gaggcaggag 
ggagagaaat 
taaaataggg 
ggaaaaagac 
aaagcgggag 
ccccctccct 
agacaccgga 
gaccatggta 
tcagatagac 
cctcttgggg 
cccacaagta 
gtaccggttg 
ccccaatgtc 
ggtagaactt 
agctagaaaa 
ctgtcagtcc 
tcgcccggta 
cccgaacccg 
agacttaaaa 
ttttaaatgg 
acaggggttc 
ttttcgcgct 
ggcggcctcg 
cttggggtat 
ggggtatacc 
gatgatccca 
ttgtagactc 
ggaaggggtt 
gtctctaatg 
tgtggacgag 
aagacctgta 
tctgaaagct 



tcccggggct 
tggccgggaa 
gttctgtttt 
ttttgcggat 
cgtgtccagg 
accagggacg 
tgggtttgag 
ttttgttgcg 
cgagtcccac 
ttgagttgct 
ctcgtgtccg 
cccctttctc 
gtggaagtca 
gtaggctggc 
gtttttcagg 
ctcgtccaat 
gtcgcggttg 
tatccagaga 
caacagccac 
gcgggggaga 
acagttgcct 
attcctttac 
cctcccttct 
catcaaccca 
agaaaaagaa 
gccctcccta 
accgcggaag 
ggagccgcta 
gaaccaccct 
gaccccttgt 
cccgacatta 
ttagtaaaag 
agagagaaga 
ctgagtaaaa 
ctcctgggca 
caatgcgcct 
cgatccaagg 
gagcctaggg 
gcagaacatt 
attggagcca 
aaaaatatag 
cgcgatctgc 
agctggggaa 
catgaagaac 
tgggcagaac 
aaagctgatg 
ggcatccggc 
ccctggaata 
caagacctcc 
tacaatttat 
gatgccttct 
agggacccag 
aaaaattccc 
cgaaaccctc 
aaggagctgt 
cgagtttcgg 
ctccgagggg 
tcgccaacta 
tggattccag 
cctttcaagt 
actgccccca 
agagcgggtg 
gcctatttgt 
attgcagcag 
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167520 
167580 
167640 
167700 
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168000 
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168360 
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168480 
168540 
168600 
168660 
168720 
168780 
168840 
168900 
168960 
169020 
169080 
169140 
169200 
169260 
169320 
169380 
169440 
169500 
169560 
169620 
169680 
169740 
169800 
169860 
169920 
169980 
170040 
170100 
170160 
170220 
170280 
170340 
170400 
170460 
170520 
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170760 
170820 
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170940 
171000 
171060 
171120 
171180 
171240 
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tagccctgct 
cccctcatgc 
cgaatgacac 
atcctcaacc 
atggacatcc 
agagctccca 
ggagctgcgg 
tcggcacaaa 
atcgttaata 
atctacaggc 
attctggccc 
ggccacccaa 
aaaacaagtt 
tgaggatggg 
tttttgaaag 
accaattcac 
aagtccccaa 
tcactaatgc 
gagtgtattg 
tgttagtatt 
ctgcccagat 
aggtaatcgg 
ctcagttggg 
tagagaagat 
gaaaagactg 
actccttttg 
gtttccgact 
gtgattagga 
gtaccccacg 
cttgagccac 
gttgacggaa 
gaagaaaacc 
ctgcgccgca 
gtcctgggag 
cctctggatt 
caattgggac 
cccaaatggg 
agctgcacat 
cgggggggca 
ctggaagccc 
gtcaaacgaa 
cagagcaccc 
cgagaacgga 
atacattcca 
ggcagtgaca 
tggctccccc 
taccctccct 
taggaacaga 
ctaaccctaa 
aaggaatagc 
gaaaaaacag 
tacctcagga 
gtcagtatct 
gtgtgtctat 
ccagactctt 
gaaaaagaga 
gagtaggtac 
gcagttatgg 
ttaacttccc 
aaaaaaagag 
ggagtaatca 
agaaaaagcc 
ctcctctcca 
tgcatcctta 



gatcaaagat 
cttagaaagt 
actatcagag 
cagctaccct 
tcgctgaaga 
gttggtacac 
tggtagacgg 
aggctgaact 
tctacactga 
agcgagggct 
tgttagaagc 
aaaggagaaa 
gctcaagggg 
aggtataaca 
aaaaatagaa 
ccacctggga 
cctgaagtca 
aactaaagcc 
ggaggtagac 
tgtagacacc 
tgtggccaag 
gtccgacaat 
catcgattgg 
aaataggacc 
ggtggctctc 
aagttctgta 
ctgaccctgt 
cccagatttg 
ggttccgagt 
ggtggaaggg 
tcgcctcctg 
acgacgacaa 
ggcgccactc 
gtgcttaatg 
tggtggcctg 
ctctcagatc 
atagggagca 
ttttatgttt 
tcagattact 
tcctctaaat 
ggagaaagaa 
tcaggaccat 
aaacaacacc 
ctaagagatc 
ctcgtagggc 
aaaggtcccg 
aggaactaat 
ggaccgtctg 
tatgactcaa 
tcagatcagg 
aaagttgact 
taaagggcac 
ggtgcctccc 
gtctgttttt 
gtatcatgat 
acccgttacc 
aggaaccgct 
atattgatct 
tgtccgaagt 
gactctgtgc 
aagattctat 
aacaaagatg 
ctatagcagg 
ataagttagt 



gctgacaaac 
atcgtgcgac 
cctgctgcta 
tctccctcta 
aactgggacc 
ggacggcagc 
gaaaaaggta 
tatagcgctt 
cagccgatat 
attgacctcg 
catacatgca 
acttggtggc 
ccatgatctt 
taaaagagct 
ttaactcccg 
gttgaaaaaa 
gtggctcaaa 
tacagagaac 
tttactgaag 
ttttcaggat 
aagatccttg 
ggaccagcct 
aaattacact 
ttaaaagaga 
cttcctcttg 
tggaggacct 
cttaccctcc 
ggaccaactg 
tggagacaaa 
accctatttg 
gatccacgcc 
ttggacagtg 
tagacctagg 
aaaaggaaaa 
atctcacgcc 
atactgatct 
catatgggtg 
gccctggtca 
tttgtggtaa 
gggacctaat 
acccctataa 
gcaaagataa 
gtctaagttg 
ctgggttcat 
ccaacaaggt 
actgtaccag 
actctcctca 
gtcagtctag 
tcatgctggt 
acttataata 
ctagcagcag 
ctctgtaatc 
ctagacacag 
aatagttcca 
aatagttctt 
ttaactttgg 
gccttaatta 
tagaactata 
ggtgctgcaa 
tgccttaaaa 
ggctaaactt 
gtttgaaagc 
acctttaatt 
agcttttatt 



tgacaatggg 
agccacctga 
aatgagcgtg 
acaaatgatt 
agaagtgacc 
agtttcctga 
atttgggcaa 
atacaagccc 
gcttttgcta 
gctggtaaag 
cctaaaaagg 
caagggcaac 
aactgaaaaa 
atggtagacc 
aagaaggaat 
tgatgagact 
agattataga 
ctggaaagag 
ttaaacctga 
gggttgaggc 
aagaaatcct 
ttgttgccca 
gtgcttaccg 
ccttgactaa 
cgctcaaaca 
ccccccttaa 
tctttgctta 
aaagcagcct 
gtcttggtca 
gtgttactga 
tcccacgtca 
gcagtcactg 
gaaccttaac 
cattgtatgg 
tgacatctgt 
tagcaaccca 
ttcggggcag 
gggtcagagc 
atggacatgt 
cacggtaaaa 
atatcaagag 
atactgtaac 
gcttaaagga 
tttcacgatc 
ccttataaaa 
ctccaccaac 
taaagcctac 
tccaaggagc 
tatgctatgc 
ctacttcaga 
tttcaggaag 
agacccagaa 
tgtgggcttg 
aagatttctg 
ttttagataa 
cagttctatt 
agaccccccc 
gaacagtcta 
aatagaaggg 
gaagaatgtt 
agagaacgcc 
tggtttaata 
acacttatgc 
agaaaaagga 



acagcaggtg 
cagataagat 
taacctttgc 
ccgtcccagt 
tgactgacca 
tagaggggaa 
gcgctttgcc 
tccgagaggc 
ccgcacacat 
acattaaaaa 
tagccatcat 
cgaatggcag 
ggtgatccgc 
agtgatcccc 
aaaatttgta 
aattaaaaat 
ctcctgcaaa 
acaacgggga 
aatgtatggt 
atttcccact 
gccaagattt 
ggtaagtcag 
ccctcaaagc 
attagccatt 
cccctggtcg 
tggaagctgg 
• ttcatttaaa 
ataccccagg 
gacggcatcg 
caacccctac 
agagggccgc 
acaatcctct 
cctcatgctc 
gcaaccactg 
aagttagcgg 
ccccctgagg 
ttctaccgag 
aaaaggcttc 
gaaacgacag 
cgaggtagtg 
agtgggtgcg 
cccctacgta 
aataggtggg 
agattgacag 
caggggcccc 
tccacagccc 
cttggcttcc 
ttttttagtt 
ctctagcccc 
tcattctcaa 
agggctttgt 
catccagtct 
caataccggt 
cattttggtt 
atttgaacat 
aggattggga 
aatactatga 
taaccaaatt 
aattagactt 
gtttttatgt 
tagatatacg 
agtccccttg 
ttttgcttac 
taaacgcagt 



accgttgtag 
gacaaatgcc 
gccccctgcc 
acatcaatgt 
accctggcct 
gcaaaaggct 
tgaaggaaca 
taaaggtaag 
ccatggggcc 
caaagaaaaa 
ccactgcccc 
acttagtggc 
ccaaaagccc 
tcccatactt 
aaaggactac 
tcccgatacc 
ccatgtgcat 
gaccatcctg 
aacaagtatc 
aaaacagaga 
gaaatcccta 
ggcttggcca 
tcaggacagg 
gagaccggca 
tttcgggctc 
tggaacatta 
ggccctaaaa 
gaccaccgca 
aaccggtagc 
tgcggtaaaa 
cagtcaagat 
taagcttcgt 
caattcaaca 
cagtccatcc 
caggatcccc 
agcggtgtgt 
ctaatcttag 
aacaaaaatg 
gagatgctta 
gctatgataa 
cttttaaaaa 
taaggttcac 
gttggcgagt 
tgagagaccc 
ccagtcgtac 
aacacagtgg 
ccaccgcccc 
ctaaatagaa 
ccttattata 
tgcctttggg 
ctgggccggg 
agcaaaagcg 
ctcactcctt 
cagcttattc 
cgggtccgct 
gtagcagctg 
agaactacgt 
agaagaatct 
attattcctt 
tgaccattca 
taaaagagaa 
gctcaccact 
ttttgggccc 
ccaggttatg 



171300 
171360 
171420 
171480 
171540 
171600 
171660 
171720 
171780 
171840 
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172980 
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173460 
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173700 
173760 
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174060 
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174360 
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174840 
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174960 
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gtactaaggc 
tatctcctaa 
cagaacccga 
agaatgcatt 
tagaacattt 
caaataatag 
tgacagaacc 
ttttactata 
gtgcatgagt 
ctattaaatc 
tgtcccgggg 
cagaggaaaa 
agacaagatg 
tgtgtggtgg 
tacctctgtc 
cagataggaa 
ctcaatagga 
gctggccagg 
ctctgtgagt 
aggagcctgt 
tttaatcccc 
attggagcta 
agcttatctc 
cctccacaca 
cagctgcaca 
tattagtttg 
ccttaaaaca 
aagagtactt 
agcaatagcc 
atgagcacca 
ttcagccttc 
aaggctaaac 
ttgtagttat 
ccagtcacct 
ataatggtat 
agccatcgca 
attataaatc 
ccatcctaga 
ggacattgag 
tgttggacta 
gggtgtgtgg 
atttcgaggc 
tgtttcaaca 
taatacatgt 
ttctgctagc 
ggtcctcctg 
ggttttggtc 
catctggcaa 
tgtgagattt 
ttttcattcc 
tgtcaatcca 
ctagggtggt 
agaaaatggc 
gaccttgctg 
ccttaggagt 
accaaatgtg 
aaaatagatt 
aactggacaa 
cataatgagt 
ctcatgtagt 
tgcacgctgg 
acagaatagc 
gaggaagagt 
gttaattgac 



aacaatatcg 
aagaagtggg 
cccctcccat 
cctgaactcc 
ttgagataag 
gacatgaccc 
ccctagtgat 
taagcctgta 
ttcgacccca 
ttgccttcta 
cttgagtgct 
atgtattctg 
gaacagctca 
acaggaaaca 
cgtcaggccc 
ccaagtatca 
gtgtgttcca 
agagctgcac 
tcaaggcaag 
ttcaaaacaa 
atgacctcaa 
gtgagatggc 
tggggacaca 
tatgcctccc 
tggtcatgca 
agatcgcata 
caaaaatcaa 
tgtagtaagg 
ttccgagtgc 
gggtacactg 
ctaactgctc 
tactgctcag 
gcagaattca 
cttaaaggtt 
tccaccctaa 
ccagcccatg 
tgttggtgag 
ctgaagacca 
cctcatggac 
cccagacaat 
cacacacctc 
ctggtctata 
aaaccaaaac 
gggtaccagg 
ctagtgaagc 
agctttcctt 
ttaagaggtt 
atgtcttgac 
tgctcctcac 
tgagaactgg 
tcacaccaga 
cactttgctg 
taacgactgc 
cagtctgaag 
attgcttgct 
aagaaatatg 
tattacagag 
atgttggtac 
tagtttgctt 
ctatggtctc 
caatgtaagc" 
tctgtgatga 
ctcagtgact 
atggaaagat 



ggtccttcag 
gaatgaagaa 
ctagagattg 
tcaccctaga 
ggcctcctaa 
cttagttacg 
gtaaacttgt 
aaaaatttgg 
gagctctggt 
cattttgagt 
tgagtgaggg 
cctcatgggt 
acctgctggc 
gagagctgcc 
tacatcttaa 
aaccaccagt 
ggatgtggac 
acctttaatt 
cctgggctat 
acaaaaccaa 
atagccaaac 
tcaacatatg 
cataatggga 
acaaataaga 
tgcctataat 
gcaagcagta 
ttaagcaaca 
acaataccaa 
tcagggatgt 
ctctcagagc 
aggcattcag 
catggctgtg 
agagttatag 
ggatgcacca 
tccccaaaga 
gccttgaact 
gtattccttt 
gcagctctcc 
tcagccgcta 
actatgtaag 
tactcccagc 
gagtgaattc 
cgtaaattca 
ggttgaactc 
tgaggcaggt 
tgagagccta 
ctggacttgg 
aaaggagatc 
cactgcagga 
ggaaaactga 
cctaaccaac 
ggctcatcct 
taagaagcag 
tctttaaaat 
gtcacctatc 
gcatcaatac 
tgcaagggaa 
agttcatgaa 
ttttttttct 
aaactcacca 
atgtgccacc 
ttaacttcaa 
gggtgggcta 
caagtccatt 



gaggttgaaa 
taaaaatttt 
ttcccggaac 
gttcgaaccc 
aacaaccgca 
tagattccct 
actttccctg 
ctggtcgtcg 
ctatgttcca 
acggtctcag 
tctcccttcg 
tcagggggtt 
aggagggtgt 
tacagtctta 
aggatctaca 
ttgtagggga 
aaggagaaca 
ccatcactcg 
acaattctag 
accataaaaa 
agctctcctc 
aaagtccttc 
gaggaccaat 
aaatatatat 
ccagcactcc 
ggctagacag 
ataacagtaa 
aaatgttcct* 
ataaatactt 
tgcccagaaa 
aggtcacttc 
gtacttggca 
catcatgaaa 
agtgcctttg 
cttctggcca 
cttgatggtc 
gctgataata 
aggagtcctc 
ctagattcgc 
ccaatcccat 
acgcaagagg 
caggccagcc 
ttctatcagc 
aaagtcttca 
acagccggtt 
caaagaactc 
atctgtagct 
acttgtgttt 
ctgcagatct 
ctttgtattt 
tgccaagagt 
catccttggc 
tctttccttc 
gtgtttgtta 
atattctatc 
ccactgcaaa 
aagaaaaaaa 
gaggttctac 
ctttccttcc 
ctgtaaatca 
aggcctggct 
tcatcaactt 
agggcatgct 
gtgagcagca 



actcgctcta 
tactgaactc 
actcctgaac 
tcccaactaa 
aaatgaaccg 
tggcagaacc 
cccagctctc 
attctcctct 
tgtgctttqt 
tgtcttcttg 
ggggtctttc 
tccctcagca 
gggaaaggac 
caggcctacc 
gtttattaaa 
taaaaataca 
cagttgttta 
taagagggaa 
attagccaga 
agcatttctg 
agtccaaacc 
ccaaaaatat 
ttctacaagt 
aataaaaaga 
agaggctgag 
ggctacatag 
caaccacaac 
ttaaggacag 
agaaaacttc 
gttgtttatc 
tgtagtagcc 
ttatcatttt 
gtttccacca 
gggtgataaa 
tctcataatg 
ctgtctcagc 
caagcaaatt 
aatgcagact 
aacctattca 
tttaatacac 
cagaggcagg 
agggctacac 
tctatttcct 
tgttt'acgta 
ctttactgct 
tttttttctt 
gtcatatcac 
gctgcagtgt 
attctgcctt 
gggctttgaa 
tctgctgact 
ctgcagttta 
cagaaatttt 
tggtagaata 
aggaagcaga 
aagtgtaaat 
tcagccagtt 
aaaatggctg 
ctttcctttc 
ccttgaactt 
cacacatttg 
gacataacca 
cataagggat 
acacgccctg 



agattagagc 
ttcttcaccc 
tcttcacccc 
aaactgttcc 
ggtacattgc 
ccttgtcccc 
cccccttgag 
acaccactag 
tgctgttgtt 
ggtccgcggc 
aaaactactt 
aattcaggga 
aagtgttcat 
accactgacc 
agaacactac 
aggaacacat 
aaagcttaac 
gcaggttcat 
gctacatcgt 
aggctttggg 
aaactgcaaa 
tgacaactgt 
taccctctga 
aagaagtcta 
gcaggaggat 
tgtaaacctg 
aaaaacccaa 
ttctggaatc 
ccctggagaa 
ctggattcat 
aatgtctaaa 
gtgactggtt 
agttcctgat 
ttatattcaa 
taaaatgctg 
ctgtgtttgg 
cttcaagctt 
ggcccagctg 
gacaagccac 
atattcatct 
cagatctctg 
agagaaaacc 
tagagaattc 
gcaagtttcc 
ccttgcaaat 
taggtctcca 
agacattcaa 
cccttctggc 
tttagttgac 
tttgtccatt 
tttgttttct 
tccccaggaa 
agtctatcta 
ttttgagttg 
cgtcccattt 
aaataataaa 
tcagaattgt 
ggggtgggaa 
cttacaaggt 
ctgatccttc 
gtttttcaat 
agaatcgtct 
tatcctgatt 
aacagaagtc 
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ttctgaagta 
cgtgtttatt 
aacatgaact 
tatgctaaga 
agctttggaa 
atctgatttt 
agactcagaa 
tgtcattgtt 
ccaggctaga 
tttgagacag 
ctggcctcga 
gtgccaccac 
tgttcctgcc 
tagtaaacca 
cccaaccatt 
tttttcagct 
ggttctgatt 
tgcggtaagc 
ccaggtttgt 
cattggggca 
gatttggcaa 
caacacaaca 
aagagcaccg 
tcacaaccat 
tgtactcaca 
atgtacctca 
cattttctta 
cttgacacag 
ctctggtggc 
agaagcaaag 
tacatgcatg 
tcattaaaaa 
gctttgaaag 
aaccttttga 
taatggcagc 
tagtgttttt 
acctttgtgt 
ggcttacaca 
ccagctcact 
gggttcatgg 
ccatccttag 
ttagtttgat 
aacgagtttt 
agttatgatc 
tattgcctga 
ttgactggtt 
cctccagact 
tgtgatcatg 
gggtggaaag 
agaaatgtca 
tttatatagg 
aactctaagc 
acccttcaag 
tacgcacaag 
acacaactta 
acatgggaag 
nnnnnnnnnn 
nagagagaga 
gctgagtcag 
cgctgtccca 
gctgcctcat 
cagctgggcc 
gtgctttttt 
aatccttgca 



taagaggaga 
ctctgtctgt 
gcaccctgga 
tattttatcg 
ctgaacctaa 
gatggtaaaa 
aagaagttag 
ctgaatgtaa 
atctgaagga 
ggtttctctg 
actaagaaat 
accaggctag 
aatgtaccaa 
gaataactta 
tttggaattc 
tttttcaagt 
agcatttcaa 
agccacacac 
ttctctatcc 
cgtgcaactc 
tgtcaccctc 
cctctcatct 
actgctcttc 
ctgtaatggg 
tatattaaat 
gagtccaaat 
ttgaggttac 
cagacaagca 
acttcctgcc 
gacctactct 
cacatggtac 
gaaaaatcag 
ccaagaagtg 
caaaactaaa 
agtgggatta 
gtgtttattc 
gggtcctaga 
gtaggtggtc 
ggccttcttt 
ccagtgctct 
ctgtggtcct 
catctctctt 
ctacttctgt 
ctcacagtcc 
ccctgctcgg 
ttgttccaat 
gcagggatta 
tggtgctcag 
attccaagaa 
gggacatttc 
ttaacctgaa 
aactaaggaa 
ccccacccaa 
taacatcata 
aaaagagagg 
gttggaggca 
nnnnnnnnnn 
gcctattatg 
aactagccta 
gatttgcctt 
gctaatcagt 
agagccacta 
ttttttttta 
ataggctgca 



aagcttgatg 
tcttgaccgt 
attgcaaggc 
cagcaataga 
ggctgttata 
tcatgtgttt 
gacttgaatt 
gtgcctcagc 
actcctctat 
tgtagccctg 
ctgcctgcct 
gaactcgtct 
atagtcaatt 
tctagccaaa 
tgtgagggga 
tgcaggtcat 
aatgaggcaa 
tcttgcttgg 
taggcctctc 
aaacacggag 
ctcccttctc 
caagaattgc 
tgaaggttct 
atctgattac 
aaataaatct 
gcttcttcct 
tgcttctctg 
gttctttaaa 
atgactgatg 
ccaaagccgt 
acacacatac 
ggctcagcaa 
gatattaatt 
aaatataaca 
gtttcttttt 
atattccaca 
gattgaactc 
aggcttgtat 
gtttaaaaaa 
ttatcttaaa 
aggagaaaaa 
tgaagatata 
tcaacaaact 
ggtgggtcag 
tgcatccaca 
ttttagtata 
atgttcttga 
cactcttgca 
ccagaggaag 
tagacctctg 
aggggaaaaa 
tgttgggggg 
gctggttatc 
tggatgggca 
ccatgaattt 
gnnnnnnnnn 
nnnnnnnnnn 
tcgttggttg 
gctgagacac 
tgggggccct 
tgaggttgtg 

aggggagtct 

aatgttccct 
agacaggcaa 



agagcaagca 

agatgtgatg 

aaacaaacct 

aatgaaactt 

attcactagg 

gtttctggat 

ccgttttaga 

tgaaaataca 

tttttgtttg 

actgtcctgg 

ctgcttccca 

attacacatt 

gattcctgtt 

gtctgcctat 

atccacagat 

gattcagtgg 

gcagagggca 

aggcttagtc 

tcttaaattc 

tgtgactcct 

cctagggtca 

cattagggct 

gagttcaaat 

ctcttctggt 

aaaaaaaaaa 

cccctgacta 

cttctaccct 

gcagggctca* 

atctaaggtt 

cctctgacca 

acagaagtaa 

actttccgtg 

atagttattc 

gagtgtattt 

tagattattg 

gcatgtgtgt 

aagtcatcaa 

ctttggaagg 

agaaaaaaaa 

atcaactgca 

catacagttg 

atcagctcac 

acccaagctg 

aagcctaagc 

ttcacatcct 

tgtgctgctg 

ctgccacaga 

cccaaggctc 

aagaaagtca 

aagtctcaag 

attcttacag 

ggggtagtct 

caaaacaaac 

gattgcattt 

aagagagagc 

nnnnnnnnnn 

nnnnnnnnnn 

cttctaatca 

tgtccaccca 

tgaaatgaaa 

aaaatagccc 

ggtcctttgg 

gctttaggct 

gaatatcatc 



ggcaagcaag 
gctgtcttgg 
tttcctcttc 
agaacaggcc 
atagggacca 
atgatagatt 
gacattccag 
aagagtacag 
cttgtttgtt 
aactcacttt 
agtgctggga 
aacacccctc 
tatttaccac 
tagccatatt 
gctgtagacc 
gtcatgaaat 
tattgtcaca 
agtttctggc 
caaacatagt 
ttccccatct 
gttttacctc 
ggtgagatgg 
cccagcaacc 
gtgtctgaag 
aaaaaaagaa 
cactcacgct 
ggctccttct 
ggaccagtga 
aagcctagaa 
ccatgtgtaa 
aaagagattt 
tagaaaacta 
attatagcag 
tttttgtaat 
ttattatttt 
ggaattggat 
gcttgcacag 
caagcatttt 
agtccttttt 
aacttttatic 
gatggcttta 
atcacactca 
agcagctcca 
gggcgtggct 
ttcctggtga 
aaacaatctt 
gcactaatat 
ggggaacatg 
gaggtgagac 
aacaaggcct 
gggtccaacg 
tccccaggga 
tggtcagtcc 
aggaatacac 
aaagcaaagt 
nnnnnnnnnn 
nnnnnnnnnn 
ttagaaaacc 
ctgtcccaga 
gtcaccagca 
tgcagtggtt 
agcagagtta 
cagtgctgtg 
cctgttttgc 



ccaggatcca 
cttcctggga 
caagttgctt 
cataactgcc 
ctggaagtga 
tatcaatttg 
agaaaactga 
ggaagaaagc 
tggttggttt 
gtagaccagg 
ttaaaggcgt 
tttaattaac 
atgtttctgt 
ttcatcagtt 
gctttagaca 
taatttagtg 
gcacagcaca 
tctaaacgcc 
tagacattac 
gcggttccca 
tcacactcca 
ctcagaggtt 
acatggtggc 
acagctacag 
ttgccattaa 
ggcctgagtc 
gctgcctatc 
gactgatcgg 
cccacgaggt 
actgcacatg 
aaattgaaaa 
gagtacttag 
agatttctaa 
gtaagtttac 
tattaattat 
ttctgcttcc 
taggtggtca 
acttcctgtg 
tgtttaatta 
tggtaaaaag 
tcctgcaggc 
agcctctgcc 
aacaacagcc 
acctcgctgc 
gtgtggttct 
tttgcctctg 
ttactgaaca 
gaggaagagg 
tgcatctcct 
gaaagtctta 
ttagacaaag 
acactcctct 
tgaagccata 
acatacacac 
gggaaggggt 
nnnnnnnnnn 
nnnnnnnnnn 
actctcttag 
gcaaggccat 
ggctccggaa 
cctgggcctg 
acagtcatca 
cgctacttct 
cctcaggcaa 
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attctgaagt 
attctcactc 
ccacctccca 
gagaatgaag 
tgaacataaa 
gagaaatgtc 
ccccaaacag 
actcatgttc 
gcaccaggag 
tcccacagtg 
cctgggcttg 
gggacgtaga 
aatgccagca 
acagactgag 
agatagtggg 
acaaaaatgg 
tttgtcacct 
gttcccagca 
ctacaaagta 
caatcaatcc 
cttccagagg 
ctccaatccc 
agcatacaat 
gtacagaaag 
tttcctgtct 
tggtgtagtc 
aacaggatca 
ccccggctgt 
caactgagga 
tctggaattt 
gcagaactgc 
gggtctgtgc 
gaaaaaaata 
gaggcaggac 
cctctgatca 
tccattagag 
agtatggtag 
agaagttcag 
gttttgtctt 
tttgctgcca 
atgacgcttg 
actaagtgga 
tcgttgggaa 
gaaagagcca 
ccaaagctgc 
cacaggaggc 
atgaaacgtc 
gggctggcag 
tgcttttcca 
gctggacttt 
ctggctctgc 
gagaggtttg 
ctcaaaggca 
ccaagcccta 
agggggctgg 
attccgttcc 
gaatctgttg 
cgcaggcaaa 
tcatgccagc 
cagcctgagc 
aaaacaaaaa 
caatggatcc 
ttctccatag 
agagttgagg 



ctggcaaatg 
tacttctgcc 
tgctcagggg 
tagaagccag 
gttcagttac 
aagtgctaca 
atacaaacct 
cagcccagtt 
gaaatataat 
actgtggata 
agatctagca 
ggcagaagga 
tttgagaggc 
ttccaagaca 
agagaattca 
agcatgaatg 
acataccaat 
tctgggaggt 
agttctagga 
atcagtcaat 
acctgggttt 
aagggacatg 
atatatatgg 
agatcaacta 
gacatctgac 
cctgtgaccc 
aatcccacag 
ctgtgcagca 
gtgcctcagt 
gtcatttgat 
agaggaaaag 
tttctcttac 
gatgcatcct 
tcatcatgca 
aatcagtttc 
gcctgcatgc 
tatctgttta 
agttatcctt 
aacaaaacga 
agtttgatga 
aactccacgc 
taaatgttaa 
gggaggggaa 
gtgtgccacc 
ccagtgtgcc 
cagcaggagc 
tttccctttc 
ggctcatcca 
aggtcttagt 
actctgctgt 
caaccaccat 
gtctggtcct 
agtgctggtc 
gaggacagac 
agaaatagct 
taacattttc 
ccctctctgg 
acactcatac 
gatgctctca 
tatagaatga 
caaatctgtg 
ctagagctct 
accacatttc 
tttacatggt 



aaagatgtgg 
tgtgccacct 
ctaggaagtg 
cttcaaatct 
atttggttta 
attcagtgga 
tcatcaaagt 
ggctccttct 
agatgtcctt 
cagcccagtt 
ccctaaggca 
tcagttcaag 
agaaacagat 
gccaaggcta 
aggttatctt 
cttgcaactg 
tatacaaatc 
agagatgggc 
cagccaaccc 
catgggctgg 
gattcccagc 
acaacttcta 
gtaaaatgct 
taatgaaaca 
ttcttgcgtg 
aggattaggc 
tgctccactg 
ccgcccagat 
gctcctcagg 
atttccagac 
gggcactact 
aaatcattgc 
cacagtacag 
gcctctttcc 
aggcctggaa 
acgaagctct 
gcatggtgtg 
gcacttacag 
aacaagaggg 
cctgagtttg 
acatgccaca 
aaataaataa 
aaggcaggcg 
aaagctgccc 
accaaagctg 
agccaggttc 
ttcagaccac 
ctctagtgtg 
gtctaattaa 
aaggactttg 
gtcagtgggt 
tgtctaatgc 
cagcatgtgt 
atgctttaaa 
cagcagttaa 
atggtggctc 
ttttctcagg 
atatgaaaca 
gcactcagat 
gatgctgtct 
ggcttaatca 
gctttgcccc 
tattttgggg 
catttttttt 



gatttgaaca 
tcctctcatg 
gggagaagat 
aaactaagca 
aaaaaaaaaa 
tgtggatgaa 
ctcttccaaa 
catgtgagct 
tttaaggggg 
agtagagttc 
tagtggtaca 
gtcagatgag 
gaatttgtga 
cacagagaaa 
ggactgcata 
tggacaatat 
agattcagct 
agagctctat 
tacagagaga 
agagatagct 
acccacatgg 
ctggtctctt 
atatatataa 
accataacac 
acccagtgcc 
aaccactgcc 
taccccgcca 
gcttggtgtc 
gcagtgtgca 
tccagctgac 
gtgactgtta 
ggcctctctt 
gatggcaggt 
ctcaactaca 
agaaacggcc 
ggatttgatc 
gaggtatacc 
taagttcaaa 
gccggggaga 
gtcccttgag 
gcccacgcat 
atcaaaggaa 
agggagatag 
agtgtgccac 
cccagacttg 
cacctcagag 
agcagtgaca 
cctgtggcca 
ggttagcagc 
ggcattgttc 
cttcagaggt 
aaaatgcctg 
ggaatcctaa 
aaaaagtcat 
gagcactagc 
aaagctgtct 
caccaagaac 
ttttttataa 
ggcagaggca 
caaaaagaaa 
ttcctagcca 
ctggtctgga 
accactctgg 
gttgttacag 



cagacttgtt 
cacggggagg 
ggatgtcctc 
atgttttatt 
tccctacaca 
acaatcaaaa 
ggctgggtct 
ccgacttcca 
gtggggtctg 
tcgcctagca 
tgcccatgac 
gggtggggag 
gttcaaggcc 
ccctgtcttg 
agactttgat 
tgggttcata 
gggcccactg 
gactttaagg 
tacactactt 
cagtgatcaa 
cagctcacag 
tggtcaacag 
aaatcagatt 
atactgtttt 
tccacaatga 
cctgggaagc 
gaggacgtga 
actgagtgac 
attgaaactt 
cttggataac 
tcccctgcat 
cacttccctc 
acgaggcggg 
ccgccgcccc 
actcaggctg 
ttcagcacgg 
tatctctgca 
gctagcttgg 
tgcttcgctg 
cttatggtgg 
gaatgtgtat 
tggccactca 
ataaccctga 
caaagctgcc 
attacagatt 
gtggagccac 
gctgtcctgc 
gaacaggcct 
caaattggag 
cattccgtga 
agaagaactc 
gggcaccagg 
gttcaatacc 
gctttaaaaa 
tgctctttca 
ataattcaag 
acatgtggtg 
acctgctcgg 
gggggatttt 
aaaaaaacaa 
gaaggagctg 
gagcttgctc 
gcctgcacat 
taatgagagc 



tggccaaaga 
ggaggggagc 
aaagcagggt 
tccatttccc 
actggttctt 
tgttgaacac 
gaaaagagcg 
aagactgctt 
tctgacaacc 
agcgtgcggc 
cacagcactt 
gcattccttt 
agcctggtct 
tcaggaaaaa 
tccaaaataa 
catattctgt 
gtgcatgttt 
ctagcctggt 
ctaaatcaat 
tagtgccatt 
atgtctgtaa 
gcatgcacgc 
cacaaatcaa 
aaaagctacc 
gacaagcaga 
cattcctagt 
agcctccctt 
catcggagcc 
gcatgtgatt 
agcacagagg 
gattatagaa 
ctttgaagtc 
ttccgggact 
tgcagagttc 
gggatgtggt 
gcataagccc 
cagggagacc 
gcaacatgaa 
tgctgagtca 
aaggagagaa 
acacacacac 
aaatctacca 
tatgaacacg 
cagtgtgcca 
tggccaggga 
aaacctggaa 
agagtctgga 
cagtcacagg 
agagaagggt 
tcaaatacca 
atcctttttt 
ttaatgtcaa 
catcagagcc 
aattctgttg 
gaggacccag 
tcctagaggg 
caaacataca 
atgtggtggc 
gtgttgagcc 
aaaaaaacaa 
gctccagcaa 
tagaaaggaa 
ggaaaaatgg 
tttgaaatga 
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tcacaaaagg 
aaccgaatcc 
ttatggcaac 
tgcaaagtct 
gtcataaata 
ctaatgagga 
ttctaatata 
taaaatgata 
atccaaagca 
tctcatccgc 
taggacagac 
tattaggctt 
aaaaagcaac 
ttgcataatg 
gaacccggca 
ttgcagggag 
gacctgactt 
ggattgctct 
agcatggatc 
tgcataccag 
aaccacccaa 
ggattcataa 
tacccaaaat 
tttggaaaac 
gcaccaagac 
gccgctcacc 
cacagtcgca 
tttgtgaaac 
gtaattcaaa 
tgcctactga 
taacttccag 
gacccccatc 
agggtcttgt 
gcattattta 
catttcctct 
tatccttgat 
tttctgggtt 
accaacatga 
acaggcatgc 
cagagtaatt 
aatgagttct 
gggtgatgtg 
tggggagctg 
ggtacactcc 
catggcgggt 
aagtttgaca 
ccacaccacc 
caacaagaga 
ttgtgaggaa 
ttagtcggtg 
gaacaccagc 
ccgaattcca 
tggcgctcag 
tatcccctgt 
ctcccctcct 
gaccccctcc 
tttatcaagt 
gtttctattt 
actgagggtc 
accacgcacc 
ctcgcccaaa 
aaaagctacc 
acagggcggg 
tcaatattta 



aaaataggaa 
taaaaggaat 
tgacatttaa 
tgcaggaact 
tagcttattc 
aatatttcca 
atttatcaat 
aatagcaata 
gtgtagctat 
agatacagtc 
gaaggggggc 
tgttataaga 
tacttggggt 
aacaattttc 
caggctcata 
agaaagcagt 
taaacagttt 
caaagactct 
tccggccctt 
gaggactgaa 
actcatttcc 
aataaggcag 
ggaactgcat 
tggcacagct 
ccagggctga 
aatcccggct 
ctatgctctc 
tgttttcgaa 
actaattacc 
ttgtgcgctg 
gctaaatcag 
cctgacttcc 
agttctggct 
atttaggcta 
tcttcaattc 
cgtcccaccc 
tgcaaggtgg 
gcggcgcaca 
acacaccggc 
aggcctaggc 
cttcctttgg 
gctgaaacca 
cagtcggaag 
cagaaataga 
ttttagacat 
gagtgatatt 
gccacccaag 
gggagccggt 
ttgagtgtac 
aagcaggggt 
tcgaggaaca 
cagggcttaa 
caatctcatt 
atctgactca 
cctgcctatg 
actggactct 
gttctgttgg 
cctttacatt 
ctggtagttt 
aagcaaaaat 
gccagattta 
acttggcaga 
atggggcggg 
acctaactgg 



aatatgcctc 
gttcaagtaa 
aagtaacagg 
tcatgctaaa 
ccaaatcctt 
caaaaaccca 
ttaaacacat 
gtggtgaggg 
atttaaatac 
tagactcaga 
aataagaaat 
ttacatcaaa 
ttgcgcctgg 
attcaatcag 
gtttttatct 
cgggaggcat 
ctctctggag 
cggactcctg 
ctcactgctt 
gtttggctcc 
gacttggttg 
aggcagagaa 
tttcatgcac 
gacattggct 
ggcctaatcc 
ggagccgcag 
ccgtggggag 
aacctgggat 
ctgagcacat 
cagctcactc 
gcagccacag 
aggcgaggag 
gccgaattat 
gaatgacctc 
gtggccccca 
cacttggcta 
tcaggtccac 
gccgccatca 
agacatgtgc 
agttcctgaa 
gggtgctaaa 
tttctgcagc 
aggcacaatt 
cagatgaggg 
ttaggaatat 
accgggttta 
ttatcacatg 
ctatctttgt 
aacactccaa 
aactgcgctc 
ccaaaaaggc 
gtattaagcc 
tattatttct 
cattctcctg 
catgctttgc 
ctcaccccac 
tgagccttct 
tcttgtagcg 
tagagttgga 
tgcacccacc 
ttgctcctgc 
agatcaagtc 
tagggtggag 
taatttgctg 



ctaaaaagag 
aaaaaaaaaa 
atttgggcaa 
attatgcttt 
agaatttcta 
cttaccatta 
tgcataatgt 
gtggggggca 
ctcaatccat 
gcagcatatc 
gacaggaaac 
gcagttcata 
gggctgcctc 
gatctcagca 
cccagctcca 
tgtcctagtg 
gttgaaaggg 
ttacaggcaa 
tcttgaatca 
cggtgaccag 
gtcaaatttt 
acggagggtg 
aatagaaaac 
agaggaagga 
gcctttatcc 
aagagctctc 
gccgctccgg 
gatcatttaa 
ttgaaacatt 
tggtgtttct 
gcgctgcctc 
gcggcccgtt 
tgctcttatc 
tttccctccc 
ttttctgatt 
catcttcatc 
cagtcatcct 
ctgagagagc 
acatgcgctt 
gcaaattcat 
ccagcatgcc 
caagcctgtg 
ctgggatcaa 
ctgcctcctt 
aaaagtaggt 
actagagcca 
agccataatg 
ccaaaggaaa 
taacatcccc 
gagcagtctg 
cgattaatga 
ccaaagaaat 
ctacaaagat 
ctcagtaagc 
gtcttcacaa 
atacttggaa 
cttgcattaa 
tctcacatgg 
gttagatgac 
cttccctcag 
tgtaaagtgt 
agaagtgcag 
cgaataattg 
tgacaattac 



ccgaggcaaa 
atatggctca 
gtttctttct 
aattttaatt 
ccctgcaagg 
agaaccccca 
gccactctgt 
aaaaccagga 
tgtagaggaa 
cttgactgta 
ttcagaagaa 
tgttttaatc 
tgtgtactga 
gagatagctc 
cctgctggag 
gctgtgtacc 
gctctgtaag 
gtaaggtcct 
gggatttaga 
aggacaaggt 
caagtttccc 
tgtgtgtgtg 
ttaaagactg 
acggccaggg 
gagggtttag 
ttcacttggc 
gagggggagc 
atgtttaaaa 
tatgccatca 
ataaactgct 
cagccctggg 
tctccagaga 
cgtgttcata 
gagtcttcct 
ggtccaaata 
tgggagccaa 
aaggtgtgtg 
acgtgccctg 
tccccagcaa 
ttcccccttt 
agtggctaga 
ggcagaagct 
gaaatgagca 
attagcgctt 
tggattccca 
ttaagagact 
caagagaatt 
tgagcagccc 
tgcaggattg 
cctgtgtacc 
caaaggacac 
caaggtctag 
ccaacactca 
catcctggtt 
cgacagctgg 
ctactccttg 
agctgtgaga 
gagacaccca 
ccagcaacat 
atgttcctgg 
atcttctcta 
gctagcaggt 
aagctccaag 
gccatgaagg 



ttattatagc 
tgcgaagttg 
ccaccctctc 
agccaaatag 
agctgaacta 
tccgatattt 
agcattccat 
gattaaacat 
aacacactgt 
agggtattaa 
ataaaatttc 
tggggaggaa 
accagacagt 
ctactcaaag 
aaaccttgta 
taaagttaca 
ataccagagt 
agcagatggt 
aattgctatt 
cattgtttaa 
agcagtctaa 
tgtgtgtgtg 
aaccaatcat 
cgagccagct 
tgaggctccc 
tcagtcccag 
gacatcaagc 
tatgcacatg 
tcttggatcc 
tcagcgattt 
ttggtggaga 
gccgtttgtc 
attctcatct 
ccctcattcc 
tatagacaaa 
tgtggtgagt 
agagaggtag 
cagctcaggc 
accctgcttg 
tccagaataa 
agcctgagat 
aaccttgggc 
ctggtttata 
tgaagatgcc 
cagtcagctg 
cttcattatc 
ttcattccat 
agcgtgaagc 
cctctgcgat 
tggcttgcaa 
tcatagaggc 
gccattctcc 
atttcccagg 
tgaaacgggc 
taatttgcaa 
gaactacttg 
aggaaccaca 
ggttagatat 
gccttccccc 
catcttataa 
agcctcactt 
gacggtgagg 
agttaccagc 
gaacgctgcg 
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actatgcaag 
ttaataacac 
cagcatatgg 
ttcaattcgt 
catttgactg 
gcttcccaca 
aactttaata 
gagcgtgaga 
tgagacaaaa 
atgggagttt 
ttagtggttc 
tatggagatg 
gtagaagtgg 
tgcaccccaa 
aaattttagg 
tcgagttgag 
ttttttttta 
tggctccctg 
aaccaccgaa 
tacaagctct 
aattaccgac 
ttaaaagttt 
attagggact 
aattatacat 
aggttatgaa 
cagctccaag 
ggatcactta 
ccatcttccc 
gggtctctcc 
tttttttttt 
aactcacttt 
ctcctgagtg 
tgaagttccc 
tccggctcag 
aacccccaca 
gcaaggctac 
tgtcggaggg 
tgtcctttgc 
gccttggtgg 
atgcctccct 
atttacctgc 
agagaccctc 
gcccatcaga 
gtgattcagc 
gccactttta 
gggtatataa 
cctcacgcct 
actaagtgtg 
ggtgcctggt 
gttatgtcac 
tgctctcgta 
gaactgtaag 
aaagtaactt 
ttgtgtctgt 
ccactgacat 
gtcattcaga 
cctagggtac 
ttgtggatta 
acatcaacag 
cttgaacaaa 
tccggccttt 
ttatttcatt 
ctaaggttac 
tgagatgaag 



aatgttgctc 
acagaatgag 
ctgtgcacat 
cctcagatag 
acaggcatct 
gcccggcttt 
aggattttct 
caggaatggg 
gaatattaat 
tgaggcatgg 
atttatggag 
tagcttataa 
tgggggtggg 
gcctatgttt 
ccaatagaaa 
cagtggtgtc 
aagtccactt 
cagcgagagg 
agggaactgc 
gagcaagcta 
tgggtaatcc 
acacaatcac 
ttttgggtgg 
tgagatttgt 
atggccaggt 
tcataagctc 
ggctacccac 
aacatggctt 
aatgtggact 
tntttggttt 
gtagaccagg 
ctggaattaa 
agtgacctgg 
aattttgggg 
ttgggagagg 
acagcaagaa 
tgtgaggcct 
tccattactt 
ctgcaatctg 
ccccaccctc 
cagcctttgg 
acagaagcac 
agaacccagg 
cccataaatt 
ttgaaataga 
gagataatcc 
gcctgcaagg 
gaggtatgga 
tggtggcacc 
tgggtatggg 
aggttcctgc 
ccacttactc 
gtgtggcagc 
ctgtctgtct 
gtctcagtcc 
ctggttttga 
tcgaccagac 
tttgtgtata 
tatttagcat 
tgaaaatcaa 
taatacagtt 
gcttaaccag 
ccaaccacgc 
aacaatcaac 



tctaattaag 
ccttggctcc 
aggccgtgag 
agcccagaga 
tcccggaaag 
caaccaggaa 
cacagaaaga 
gacacaagcc 
atcttggcta 
ctaagctggg 
ggcttgctgc 
aactactcag 
gtggggcaag 
agggttgatc 
tgacctctgc 
aacggagaga 
gcttgggact 
ctccagtttt 
acagcacaca 
tctgaagaag 
actagggagc 
ttttgagttg 
tttactcgag 
caactgctgg 
aaggttgggt 
aaacaagtct 
ggacaagcac 
ctatgttgct 
caaatcatga 
ttcgagacag 
ctggcctcgg 
aggtgtatgc 
atgtcaactg 
ggacaaggta 
cagaggcaga 
gctgtctcaa 
tgggggccag 
aatcagaatc 
catttaagag 
cctgcatccc 
tgggacacag 
ctgaatgtac 
cttatgctaa 
tacacatccg 
aaagatgcct 
catgttgtct 
tagccaccaa 
taaaaaaaaa 
gtttggggag 
ctttgagagt 
caccatgttt 
tcaggtctct 
cagctaagca 
gtcattccat 
tcttggtatc 
gcttgcaatc 
tgggagtagc 
aagatctgat 
caacttaata 
acagcagtac 
cctcgtgttg 
agttaactgg 
taggaaggag 
ataacaggga 



agggctctgc 
gggagctaaa 
tgatgcagcc 
gcgcggctca 
cctgcgcgtg 
ggcttggcgt 
aaagtccatg 
aacaccccaa 
tgaaggagga 
caaaccattt 
ccggagagcc 
attttaaaca 
agaacaattg 
agcttcccga 
accacggctg 
gcagaggaaa 
cacctgaagg 
cccaggcacc 
agttaaatat 
ctgtcatttt 
aggtagtttt 
actataagta 
gctgcaacca 
ggagtagtgg 
gcttccaagt 
tcaaggggcc 
ttctcataca 
tcaacaacca" 
ctacagcntg 
ggtttctcca 
actcagaaac 
taccacgccc 
aagttggatt 
ccttgattta 
ggcagaggca 
aaccaaagac 
aacaaggtgg 
gccatcacag 
ctaagtggga 
tccatttacc 
gcttagtggc 
tttcagcgct 
ggagccagaa 
tacagccaaa 
attctggagt 
ttgatgtggt 
gccacaccca 
tgtcccaaga 
gcagtggacc 
ttgtagcttt 
cctctgccat 
ttcagtcctg 
agggctgtgg 
ttatatagtc 
atatattcac 
ctcctgcctc 
aggttctgtt 
ggcccgaccg 
aactcatttg 
ctgttctcct 
cggtgacccc 
aagggttaat 
aggaaagggc 
cagagcagtc 



atttcctagt 
ggttccatta 
cagttaagcc 
ggccctcacg 
cctacactgc 
gggtctgatc 
ggaacaaatc 
ttgctaggct 
tggtgccatc 
tctttttttt 
catcagaaga 
aacagtgcag 
catctgcaga 
ggcaagccca 
actgaagcta 
gtccaatcag 
cagggcattg 
agcccatcgg 
aggctgggtt 
taatgacggc 
ggaagaacag 
tttcacacga 
acaatgagtg 
agggtcctgg 
ctcaaatata 
tggagagtta 
aggaccggct 
gggcagggtg 
gggttttttt 
tatagccctg 
ccgcctgcct 
ggccgagtcc 
ttactgtgat 
actgggcact 
gaaagttggt 
atctttcttg 
tcaaggaaga 
atatagctag 
taaactcagg 
tgtttccagg 
ttagcgctgc 
gcagagcacg 
agtagaagca 
cccacttgaa 
gctaagtggt 
gctagggaga 
caacctctat 
cctcacgaat 
atttggtctt 
gctccccttc 
tatggacacc 
gagtcttatc 
ccgactgctt 
ctgagaattg 
ttaagacaag 
tgcctcaagg 
ctcttagctt 
actcccttcc 
gtaaagccat 
agagcagcgg 
cccccccccc 
aataaaacca 
cactcgcaca 
cttgtaacaa 



cacccgcact 
ggagcacggg 
cgctaacacc 
ccacgagccc 
aaatggacct 
cttcaagagt 
ctcctcttaa 
aactctgata 
ttctgaattg 
ctcttcttaa 
gagctcgctt 
gaggccagag 
aggctagccc 
gaagcctcta 
taaataagcc 
agcttcattt 
agtagagcct 
ttggttacct 
atctgcattt 
acaaacttcc 
ttcaccatta 
ggcaggtggg 
ttttctcaag 
taatgcagaa 
ctcctaaggc 
agacaaataa 
acctccaaca 
aattaggggt 
tttttttttt 
gctgtcctgg 
ctgcctctgc 
gtcttgataa 
gactactgag 
acacgactgt 
tggaggctag 
atccaaatcc 
ccactgactc 
gagattttaa 
ggtgggccca 
gatctgctta 
tcggggcacc 
cacggctcag 
gctggcaaga 
gtgatccaga 
acaggagggt 
taacccagga 
ttatacacac 
ctgcaaacat 
gcaggaggaa 
cagttaactc 
tggtcctcta 
atagcaatga 
gggattatgg 
aaccacttta 
atctcattaa 
cgataggatc 
tctacagtga 
ctttaagtga 
ctccccacct 
ctctcagcct 
agccgtagaa 
gtctgggaga 
aacctgtctt 
gtgcaaagga 
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gagagagagg 
aacacaagtc 
attaattatg 
tatttatact 
caatttgacc 
aatggaaggc 
tgccttttac 
tagagcccct 
attgagctct 
gtccaagaga 
aatccctcac 
tttttgagta 
ccacgggttt 
ggattggttc 
tgaccgaatg 
ccatctctaa 
aacgctttgt 
gaatcaagtg 
tactaaacca 
aaagatgcca 
ggagacagaa 
aagctaccca 
cttgtatttc 
acatagggaa 
agggtgcatc 
gtgagggccg 
tgggtggccc 
gtttccaagt 
ccagtatcaa 
acacacagag 
ggctgccaca 
cagcagcagc 
agcaactgaa 
tgcacacatc 
gcagagatgt 
ttaggaacac 
gttttcttcc 
taaacatcaa 
ccttttgtgt 
atctccaggc 
tatttattat 
aagtaggatc 
tcttaaattc 
ggtaactgat 
tgtctgcagg 
caagtaactg 
ttccccaagg 
taatatcaga 
cagatattaa 
caaccatgca 
cacctgggtt 
cactggaaaa 
acttgtgaca 
aggcaggagg 
acacacagcc 
tgggttgcca 
aagccattct 
tcacatgttc 
cttcacgtcc 
tttgtgagtg 
tcaatgagtc 
tcctggagtt 
tcagacctag 
agcagacagg 



ctgagtttct 
aatataaacc 
tagtcaatag 
tttcgaagac 
ctgtagttca 
ataaaacact 
aaagagactg 
ccctcccctc 
aagtgataga 
attcaaagaa 
aggcgttaac 
gggaaatctg 
ctacatgcag 
agcgacctca 
ctgtgagcag 
cctgcaattt 
aagcaaggtc 
taactcaatc 
gtgagcctca 
acagctaagg 
acaggaggat 
cttgaagact 
tagtgtatga 
gatggtgcct 
gatctaaatc 
ttccagagat 
aggggtccag 
attaggacca 
gcttcaggcc 
cactagcatt 
gtgagagtcc 
catttctcca 
acatggggtg 
acaggccagg 
gcagcagatg 
actgagcagg 
tcgatcacgg 
tccaaccccg 
tttcattcat 
ctcagtctaa 
gaaggctaat 
cacaaagata 
tatttaacat 
gcaaactgaa 
ctttgtcgga 
accccactat 
ggacagcaag 
gtcatctgac 
cgataggcct 
gtaactctgc 
ctcccctact 
gcacactcca 
taatttctgt 
aggggactgc 
ctctgagctc 
gcctccccta 
atttctgttc 
ttcctgatgt 
atcacagtgg 
gcagagcctg 
cagcaggaag 
gggccagatc 
catcctcccg 
taactcaccc 



acttctataa 
tttagacatg 
ccatgggttt 
aagccactca 
aaaaaagtca 
ggaaatatgg 
gagggaattc 
agctgcttaa 
tgagaatcag 
ctgtgggcca 
agtggaacca 
ttaaacatcg 
ctgtgctaga 
gaggacattc 
gagatatcaa 
gtcaatctca 
agatagaggc 
cctatctcct 
agcaaagcct 
gccagggatg 
caggacttca 
gtccccaaca 
gacgaaggaa 
caaaaacagg 
cacatacctg 
ttaataaaga 
actgaataaa 
cagcctggcc 
agcggcaggg 
cacctcctgg 
cctaccttcc 
gtaacttgct 
ctgtgacccc 
cctgaagatg 
ccaagaggtg 
catgctcttg 
tctccacgtt 
aggggccaga 
ttaaagatgc 
tctctgagtc 
taattgcttt 
tactgtttat 
agctttaata 
gccatgctct 
tttactaaga 
gtggataatg 
ccagtgctta 
tcagtctatt 
ttgattaata 
aaaacagacc 
cttctccccc 
ggttacataa 
ttgaagataa 
tgccacacta 
actgtctttg 
gaggatcttc 
ttctctcttc 
tcgcgatcag 
ccttaattcg 
gcgtgcagcg 
aaggaaagtt 
ctcttccaca 
gagaagcctc 
taaagcatca 



ataaaccctt 
gggctgccaa 
cattagcgta 
gggaaaaaat 
gaacagcaca 
acagaaatca 
agaaactatt 
cactggggtt 
aacaggaaca 
gaatctactt 
atttccaagg 
tcccacgagg 
tctgctgaag 
ttgttcacta 
aggccggcta 
gacagcaatg 
tccataaaaa 
gagattaggg 
gtctgttctc 
tagtgcaggg 
aggtagtttg 
aaacaaacaa 
gaaagctcaa 
acagccgagg 
gatttaaagt 
cccaccctga 
gggaaaatgg ■ 
cctggcatgt 
cactggacag 
tctcttcttg 
tcaccgttaa 
ttccacagat 
ttggcaccac 
ctgggggact 
ggctgcagca 
ccgaatgaaa 
tcagagttgg 
tcatcggtgt 
attccagggt 
tgtaatgagt 
ccagttacaa 
tcaaacaaag 
aaggtacaca 
gtagcagcct 
ttctgttatc 
aagtaaatta 
tcagccgtcc 
aaacctatca 
attctacctt 
ctttgattcc 
caggaggaac 
tttgcctcat 
caaaatttca 
ccggtggctg 
cttatcagtg 
attgtggagc 
cgttctggcc 
ccgtctgcca 
agattcatcc 
gaagagagaa 
tacaagtctc 
gaccctttcc 
tgtctttcta 
ctttcatcta 



ggcaggcgga 
acttcacttt 
ttaaatacca 

ggtgggggga 

ctagagatta 
gatccctgcc 
taaaataaag 
gtggtggacg 
gtgtttttga 
aggcag.tcct 
cagccctgct 
gagcccagct 
tggccggtga 
gccctcgtgc 
ctggactgaa 
aagactgtga 
ttgttcaggg 
aagggaagga 
agcaaggtga 
tgctgtgata 
ggctataaaa 
gctgggtatg 
gcccgagacc 
agcagacaga 
aatatctggg 
ctgagtatgg 
gaggaagttc 
gctggccagc 
ttcccacaca 
acaacagata 
ggatggtaca 
actgttatag 
aaagccatgg 
gcaatgctgc 
accagagata 
agcctcgcag 
cttggtgtta 
tcctgggctc 
tgcaaacatt 
taacatcttt 
gaatccttta 
caaaggaaac 
ggtccgcctg 
ggatgtccca 
ttcaaacagg 
tgcaatttgg 
tcagaggaga 
aaccctgaag 
gttgccattc 
aggcagacgc 
tcaagacaaa 
tatccagagt 
tgaaatccga 
agaactggag 
agtcccaaga 
tgtcccatgg 
accagtggta 
tagtctctga 
tttagaaaag 
ctttctttgc 
agagagaaag 
ccatcctagg 
tgggtgcagt 
gaggagctct 



tcactaaagg 
tcgacagtat 
cgatcaatat 
ggaggaggaa 
gcaagggttt 
ttcatttttc 
gcaaaatgat 
caaaataagc 
ggcaaaatat 
ctgggacccg 
ggtgatctga 
ctttcactcc 
ggaggtgtgg 
actggggcga 
aactagatca 
ttttctagtc 
ttcaggcaga 
aggctgtgtc 
gccacccacc 
tcaacagctg 
tataagcttg 
gtggtcgatg 
tgcttgggtt 
cagggcagac 
agactggtct 
gcaacaccca 
agctggtagt 
tagtctagct 
cgacacacac 
aaatgtaact 
aactgtgagc 
cactaagaaa 
caagctgaag 
ctggattctg 
attaatatga 
tgtaatgact 
ggctgccgcg 
aatcgccttt 
agtgagaatc 
ccctagtgaa 
cagtcaaaga 
aaagcttctt 
gcaaccgaac 
gtgccacctc 
gattgtgtct 
gggtttgctt 
caattctgat 
gaaggatatt 
taagcattaa 
accctctgaa 
aaggtgccac 
ggggttaatg 
caaagccgga 
cggaaggttc 
ggggcccaga 
ggcgggaagg 
cttgctccca 
agtccacggg 
agagaagctg 
ttcagtggct 
tgctgtgact 
tgccctgtgc 
gggggcccag 
gtggtagtag 
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ggactgaggc 
tccatcccag 
ccagcctacc 
ggtgataacg 
cacttgtctc 
cacaagtcta 
ccgctgttta 
ctccatggct 
aagcttgcaa 
ctagatagat 
ctggacatca 
tttgtttaat 
annnnnnnnn 
nnnnnnnnnn 
ctcctgctgg 
cacagtgtca 
gaactcgagt 
gattccctgg 
tgtaccgatg 
gagaacagca 
gagggggggt 
aagttccaca 
agaaacagat 
ttttctcctt 
tatatctaag 
cattatggat 
agaacagtca 
tgagttaggt 
caatgatact 
ttgctggtct 
gttatgtacc 
gtttctctgg 
ctcagaaatc 
cccagtttaa 
ctggatgttc 
cctctgcctc 
ttttttaatg 
cccagagcgg 
ttttaaaggc 
cacacccgta 
tgctggtctg 
cagtttgagc 
gcagagaagc 
aacaggaaat 
cttctctact 
aaattaaact 
gcctggttct 
gcctcactag 
tggcctgttc 
cagaaccaaa 
ctcttcagca 
acgtgcggtt 
tggagctcag 
tgcggagggg 
ctgcacccta 
cactgcgagg 
aatggcccag 
cccttcctca 
tgatgcccgg 
gcgttggctg 
gggtgaagca 
gactcgagtg 
cgtgcccttt 
ggctgcccag 



ttctgctcca 
gaagacagaa 
tatgttcatt 
cagaatgcac 
cttgaactaa 
gctgggggcc 
caagtatggt 
ttaggttcag 
caacagctcc 
ggggcaggac 
ggattagttt 
attctgagtt 
nnnnnnnnnn 
nnnnnnnnnn 
ggttacaggc 
agctggatat 
tggtgaaata 
tcacatgctt 
catgcttgca 
agtgaacaaa 
gtaaagctct 
tgtaaggtca 
gcagtctgtc 
cctttctttt 
tacactgtag 
ggttgtgagc 
gtgctcttaa 
tttgtgtagc 
ctgcctgatc 
ggaactaaca 
accacacctg 
gtagtcctgg 
cgcctgcctc 
gggttttttt 
tggaactcac 
ccaagtgctg 
tgtgtatttg 
aagtgttctt 
accaaactct 
cacacccgta 
gcctgtgtat 
acaggctctc 
atctgcaagg 
ctcaggagag 
tcccagaaca 
ccttggtttg 
gctgctggca 
atccgcgtct 
aggatgatct 
cgagcctcgg 
ggggtgggga 
gtttccgtgt 
cgctggtgct 
taagaaaacc 
ccccccgctc 
agggccgtgg 
gggagcagaa 
ggacctgaca 
cagctggccc 
gctagtctcc 
gaagtcagga 
gaggtgcatg 
gaggaggggt 
aattaccgga 



gctctgggca 
aatccctaca 
taatacaaca 
aaactctgcg 
acagtgtctt 
ttcatgcttc 
taggagcccg 
agtcattctc 
aacacagaaa 
ggactgtcca 
aaacacttgt 
ttagaaccta 
nnnnnnnnnn 
nnnnnnnnnn 
ttgagccacc 
gatggcacat 
gtccaccagg 
taaggaaaga 
gccacccaca 
caaacaagcc 
tggcaggtgg 
tgtgatcgag 
atatctaaac 
ttctctccct 
ctgtcttcaa 
catcatgtgg 
ccgctgagcc 
cctgggtggc 
tgtcttaatc 
gagatctgcc 
gagtttaagg 
ctgtcctgga 
tgcctcccaa 
ttgtttgttt 
tctgtagagc 
ggattaaagg 
tatgggtgtg 
aaccgctgaa 
taggtaggag 
cacaccacat 
gagctggaac 
tggtttctgt 
ctgtgaccac 
tatttccttt 
ttgtgtgcca 
cagagtagca 
gagctgacac 
cagagtctgc 
tcctctgcat 
acgaagcccc 
caagagcaag 
ctgggagatc 
tcgcgctccc 
cccaggcttt 
ctcgttcatc 
ccaggctcag 
ggcggaaagt 
tggaggagct 
tggcgccggg 
gggagcgccg 
ctttactgca 
ggctgcggaa 
cctgtagccc 
acctgtggcg 



tctgctcttc 
atctacccga 
ctattcacag 
agacggcaga 
cagggtgaca 
gtaattaatt 
ttggcctctc 
tgcctgtctc 
ggtcgacagt 
ttcatcatgg 
acttttacca 
aacaagagtc 
nnnnnnnnnn 
ngccttgaac 
tagcaatagc 
ctaacctcca 
tgctgcccaa 
gatgtcctct 
agtgcactgt 
ggattgtgac 
gtaacactcc 
cctccaacag 
gtatctctgg 
aaaaattatt 
agagggtgtc 
ttgaactcag 
cccccaaccc 
ctacactggc 
agggactcac 
ttgcaaatgc 
tgtttgtttt 
tagaccaggc 
taaaggcgtg 
cagggtttct 
tgaactcaga 
cactgcccag 
tggaagccag 
tccctcttcc 
actatacaca cacacacaca 



aggttacttc 
ctctcccttg 
actaaaatat 
gctgcagggg 
tgaggcagaa 
catgtgctta 
actgcctggg 
tgcatgcctc 
gtgctgtgag 
gtaagcaggg 
caggtggggc 
aactgtggga 
nnnnnnnnnn 
nnnnnnnnnn 
atgacagctt 
gcttgcaata 
taaaagagct 
acttgccgaa 
cacccacaca 

gggggggggg 

ctggctcctg 
tacatctggg 
cattgttgtc 
ttctctttta 
acacaccaga 
ttgctgggat 
atctctccat 
gttaccttaa 
attttgagat 
tgtttctgcc 
gttttttgtt 
actcgctctg 
gtgctgggat 
tttcctgaga 
aggttagcct 
cgtgtgccat 
tgggtgcttg 
ccatctctct 



gaccatgcct 
caaaaccttt 
tctctgtcct 
gctggctggt 
taagaacgcc 
agtggcaagt 
tgggagcatt 
ttggctaaag 
aggagaaatc 
ttaagggcgg 
ctaaaggcag 
aggcgggatc 
acatgacccg 
cgccctgctg 
ctttcctttg 
ccagtcttcc 
ccttgcgctg 
ggttcttaca 
gctccggagc 
gggcactcat 
actgggaccc 
acgttcggta 
acggagactg 
tgaaactctc 
ccatgcatac 



gagcacacaa 
gtcgggagat 
gtgtcgcatc 
gctctgccat 
agacttttgt 
tattaaccaa 
gagagggtgt 
ggctggcatt 
agagagggga 
ctggtttgcc 
taggagagac 
tcgcccggcc 
catcagctga 
cgccccggag 
tcgctggttc 
cggcctggca 
cccccaggcg 
gcagggtccg 
gtggagagag 
gtagtggccc 
tgtccccggg 
cagaggctgc 
tcctacccgc 
actcggttca 
cacacttatg 



accattcctg 
ctttctgaca 
gcactaagct 
gttcctcctc 
cctagggaca 
actacatgca 
gcctctgcca 
tccgttggta 
ggatagatgg 
ctatgcagct 
gcacgtgcta 
cacacataac 
nnnnnnnnnn 
ttgcttctgc 
tttgtaaatc 
aagattccct 
gcctgaatct 
gagtgccacg 
ctcacacagt 
cagaataatt 
ataagtgggg 
tccttgnnga 
gtagtctttc 
tatttattat 
tgtcagatct 
gaccttcaga 
ctttctcttt 
tttgaacttg 
tacatagcct 
tgggaataaa 
tcgagacagg 
tggccttgaa 
tgccaccacg 
ctgtgtagct 
aatctgactg 
tgattttttt 
gtgtcagatc 
ctaactctga 
cacacacaca 
gtggttttat 
ccgcagtctg 
ttgactagag 
ctacattttc 
gcctgggcca 
gtgctttgga 
atgcctaaag 
tctgagatga 
gaaggtccag 
cacgtagccc 
tgagccttgg 
ctttagagac 
cccgtcacgg 
cgcaggaccc 
gcgcagtcac 
ccccggaagc 
gccaggacca 
agggctggtc 
atctgaacat 
tagtgtccac 
ctgagggcct 
ccccaggctg 
tgggtggagg 
tgcaggaggt 
gacagcctta 
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cagccacagc 
gagggtgtat 
cagggatggg 
ggctttgctg 
gcatgatgta 
tcttctcttc 
ggctctgtct 
cctgagcgct 
tgggacaggc 
gaaagaactt 
cctcatgcag 
gctgccctgg 
caagttccga 
ggcatttgtt 
tgacgtgctc 
gctctgtgga 
gatgcagacc 
ttgctctatc 
ggaatttgtg 
tcactggatt 
gaatgtgtgt 
gctatttgat 
tccactcatc 
ttcaggacaa 
tggctggact 
tctggattcc 
agcatcacct 
caaagctgcc 
cttcagtccc 
cctagggagc 
gagcctcttc 
caccagggtc 
gggtctctgc 
actcctacag 
actgtttgag 
cagccccttc 
tcttttatat 
tctgtggcag 
cctgcctttc 
cctatttgaa 
gcctctcatc 
cgactgtttt 
gctgccccat 
cctggtcggg 
tgcctttggg 
cccagactac 
cttccaagct 
cctgggccag 
tggggatgat 
ggacttgaag 
ggaggaggac 
gctgtccatg 
gccgctgaca 
ctgagggggc 
accagatcca 
ctgtgtgtga 
aaagccatac 
tactcgaggg 
aagtggtggt 
aagagtgttc 
gagaggcagg 
ggtgcaggcc 
gccagcctgg 
tcgaaaaaaa 



actgccccct 
ggatgcacct 
ccctgcccct 
gagtcgcccg 
gttaccttca 
cgtgttctga 
ttgcaccaca 
tacgagatgc 
attaagtctg 
cggggccttg 
ctgaatcggt 
gtggtggact 
ctcaacaagg 
gcaggtggtg 
tctgacatca 
catgtccgag 
tggacaccgg 
caccctgaca 
gctgcccatc 
gatcttacct 
ctgcacctgg 
cagccacacc 
ccccggctgt 
cttataaatg 
agagataggc 
atctccctcc 
ggcctgttgt 
gggttggacc 
atacaggcct 
cagttggagg 
catcgagaca 
cggatactgc 
atacgccact 
ctgagcggac 
tataggccgg 
agctccgtgg 
caggcccggc 
cagctgggtg 
gtgctgtcgc 
cccgttgcca 
ggtgcctatg 
gtggcccagt 
gtcctccagg 
accactgagg 
gaagagattc 
aggtcgggcg 
ggactctacg 
ctgagtgata 
ggcggtgccc 
cagagcgaag 
caggaggatg 
gagacggtgg 
gagcagacag 
atgggtcagg 
ctgtagcacg 
gtggaaagtg 
cttgggctga 
ccctgggtgt 
tgagatgtga 
cagacaggga 
gaaaggctaa 
tttaatccca 
tctacaaagt 
aaaaaaaaaa 



cagctctacc 
tcttgccagt 
ctcggggcag 
agatgctcta 
gccctgccaa 
gggccatgga 
ttgctgtaga 
cttccgagga 
aaaaagaggg 
tgctagactg 
tggcaggtcg 
ttaccacacc 
gagataagca 
caggaagtgg 
cgtactatgt 
cgcagtggga 
atgagtgcat 
tgcccgacct 
gagccctcct 
ttggctacaa 
tggacgctca 
cccaacgcct 
tggtccagcc 
gtgcgggcag 
ctgggacagg 
ccgggaaagc 
ctttttctgc 
ctggggaggg 
tggaagagct 
agcctgaaaa 
tgcaggtcct 
agcctgatgc 
ccaaggacat 
ccaaaagtcc 
tttcccaggg 
tccccttccc 
gtgtggagga 
cggtgttaaa 
tcatgtctga 
aggccctggg 
agagcccctg 
tggtggtgcg 
tactggctgg 
atgaggaaag 
agatggatgg 
tcagcttcca 
tggctgaatc 
agagcagtac 
ctgcggacaa 
gctctgagga 
aagtcacggg 
tggctcctgg 
aaggcaaaga 
tgcttttcct 
cacagccaca 
gggcttagat 
ggctgggagt 
cttatctcta 
atggagccag 
acagtgtgtg 
aatggttttc 
gcactcagga 
gagttccagg 
aaagagtgta 



tgccctagac 

gggtgaatcc 

ccctgcctgc 

tgtggtacac 

gctgaccaac 

tgcctgtcac 

cgagaagcta 

tgaaaaccag 

ggaagggaga 

ggtccatggc 

acggcagggg 

ttatgggcgc 

attggacttc 

ggagccaccc 

atacaaggcc 

accccacgag 

acccgagttc 

ggatgtgccg 

ggagagctgg 

actccagggc 

cacccatctg 

ggctggatct 

tattcgggag 

gcttgtcgta 

agaagatgat 

aggtgaccag 

accctcgggg 

tgaagagggc 

ggagaaagtg' 

gcctcacgcc 

gggtgtcctg 

acctttgtgg 

ccccgtgtct 

catggtgtcg 

attaccccca 

tccatacttc 

tgaggtccag 

tgacatcact 

ggagcacacg 

ccccaaaaat 

ccgcctgcat 

gctgggcttg 

ggtggaggct 

tgagctcccg 

gcagccggct 

tgaccaggcc 

tccacagccc 

cagcgaagcc 

gaacagcgtc 

agaggaggag 

aacatccgag 

tgatgggaga 

acaaaagatc 

tcaggctctc 

gtcagacaca 

ttagctttca 

ggagttgagt 

gccccagaag 

aactggccgg 

tgtacctctg 

ttaagagagt 

ggcagaggca 

acagccaggg 

gaagggtgga 



tctatacgac 
atcccatgtc 
cccagccttt 
ccttatgtgc 
agccaagcca 
cgccaggggc 
tgcagtgagc 
gagggctctg 
actgagtgtc 
cgaatc.agca 
gatcccaact 
ttccgagacc 
acctatgaga 
catgttcctc 
cgtcgcacac 
tatcctgcca 
tacacggacc 
gcctggtgca 
gaggtgtccc 
aaagaagctg 
accagctatg 
cctgccctgg 
gccacaggcc 
gaggccactc 
ttagaacagg 
ccaggctctt 
tctcgaccag 
aagattgtcc 
ggtaacttcc 
cagccacctg 
ttggctgaga 
gtacgctttg 
ctgcagcctg 
aagaagggca 
cccagcccag 
ccagcactgc 
ggtcgggagc 
cccgagggct 
gctgtgtaca 
gccaacaagt 
ggccgcttct 
caggccttcc 
tcccaggagg 
gtgtccgggc 
gcttcctcag 
gacctgccgg 
caggaggctg 
tcccagggcg 
aagtcagggg 
gaggaaggct 
ctcactctgt 
gacagagaag 
ctccttggtg 
atatgctggg 
gtgcatggaa 
ggagacagaa 
ggtagagcac 
aagtattaag 
aacagtcggg 
aggctctcat 
ccagaagggc 
ggcggatttt 
ctatacagag 
agccagggac 



aagctctcca 
tatcaaatgt 
tgcgagctga 
aattctccct 
aggtgctctt 
tggcctgtgg 
tccggctgga 
aagagaaaaa 
ccacctgcca 
acttccacta 
atcacccagt 
ttcgtaaatc 
tgacccggca 
accacatctc 
cgcgctcggt 
ccatggagcg 
cctctatctt 
gttctaacca 
aagacctgca 
tgaaggagaa 
gcgtggtaca 
cccctgaacc 
aggaggacat 
catgtgagac 
ctacagaagc 
cctccagtca 
gccgtaggag 
ttccagaggg 
tggccaaagg 
tgcacctgca 
tggtgtttgc 
aggctgttcg 
tgctagacac 
agctagaccc 
cccagctcct 
acaagttcat 
tggcgtttgc 
tagagatcct 
cagcctggta 
acctcctgaa 
acctgtacac 
tcacccacct 
agggcaaagg 
ctggctcctg 
gactggggct 
acacggagga 
aggccgtgag 
aggagagggg 
acagcagcca 
gtgtggtgtt 
ctgacacgat 
aggaagagga 
agcccgtggg 
tgtgggtcca 
tgtggaagtg 
agctccttta 
ttgtctggta 
aaataaaagc 
tggaagtggg 
ggttccatca 
tgggcttggt 
tgagttcgag 
aaaccctgtc 
aagtctgtac 
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aagaaggaac 
ccagtttctg 
ttgctagcag 
ttatctgatg 
tggctgtctg 
cgcctgctga 
ccagccaggc 
cactccctga 
aagagcccag 
ctgccctcca 
gttcaccgtc 
gccagtccta 
cctgtatgga 
tccctggttc 
gtcttggggg 
tctaggtagc 
ggctgctggc 
tcatggacat 
cctccttcgt 
ccaggaaggg 
ttgggggtag 
tctgagacta 
tctgacttaa 
cactttccca 
tgggggctgg 
ggttagattc 
ccaccctcct 
aaaaaacacc 
ctctgactaa 
agccttacat 
aagggctggg 
gtcctatgcg 
atggtccagc 
catgagcttc 
ggccaggcca 
ctggatccta 
cgacgaccag 
gcgtacacaa 
cacagagttg 
tcagcaatgg 
ctttaactcc 
ttgctcaggc 
tgtgagtgtg 
catctctcta 
ttttgtagac 
gctggccttg 
gtgtaccact 
cctcccctgt 
cgtaggaatt 
ttatctatct 
acttggggag 
gcaggaactc 
tggcttgctc 
tatgcctggc 
cccctgcctc 
gggtggcact 
cctccatagg 
ctgtagctga 
accaaacaaa 
cagcaggaaa 
gatggtctcc 
ttatgggatg 
gaatattttg 
tcattgtgtg 



ttgggagcat 
gggaccttga 
agccacagcc 
agctcaggac 
ccaagcttgg 
catcttgtta 
ttctgcccat 
gcagggcctg 
tattgaatga 
ctggcttctc 
agcagtgatg 
ggtgacatcg 
gaacccgttc 
gtcaaacccc 
ttccaggagg 
cccagggagc 
agcggtgaca 
tctgccccgc 
cacagggtag 
ctaggaagct 
tggcacaggt 
gcctggtcta 
gaaaagagtc 
gcatgcagga 
aaagaatagc 
ccagctgcca 
tcggtctctg 
catacacata 
gctacactgc 
gcagctcctg 
aactcttgca 
tgaaaaccat 
agcacctgag 
ggcagcaggt 
gggcagtgga 
agggctgtac 
tggaccccac 
tctacgtacc 
gtgactacat 
aatcttttta 
agctccttcc 
ttcttncatg 
gaggtgacag 
gatccagttt 
agggtctctc 
agctcacaga 
cctgcctggc 
gctgggatta 
ttaggatcag 
tagttaggat 
gaaaaggttt 
gaggcagaag 
agtctggtgt 
tgtcctagaa 
tgcatctcaa 
acccactgta 
ccaatctggt 
tgccaaattg 
ccaaacaact 
ggctcaggag 
agaagctacc 
ggcgctgtac 
ctcctacaga 
tctgatgcta 



tgccgaaagg 
ccatggctag 
gagttggctg 
cttttcctgc 
ccccacagta 
tgttggtaag 
ccttagccct 
ggtctcaccc 
atagaagcca 
aacgctgctg 
acacccctcc 
tgtcggggcc 
tcacctacca 
ggcttggggg 
tctgtggggt 
aactcaaacc 
ctgacgcaga 
attagccacg 
gcccctgctg 
cagggagaag 
ctttcattcc 
cagagagagc 
gtggttcatt 
ggagctatgc 
tcagggttta 
catggtagct 
tgggcactgc 
aaattagacc 
ttccctgtgc 
ttatttgaag 
ctttgattct 
cagtctcatc 
tgagccagtg 
aggcaggcag 
cccactgaat 
tgagggccag 
cctgctggaa 
tttctcctgc 
ctcttccctg 
tttttatttt 
agcttccacc 
ttcttatttt 
aataacagtt 
ttggttttgt 
tgtgtagcct 
gattcacctg 
tttgtttttg 
caggtgtgtg 
agtgaccaga 
ctctgttgaa 
attttgtctg 
ctgaagcaaa 
accccctccc 
ctcactctgt 
gtgctaagat 
aattggtccc 
gggggaattt 
ataaaacaaa 
aaccaagaca 
ctggccattc 
tgcaaaggaa 
tgggcttctc 
attgtaagtt 
gaatggtgcc 



atgacctctc 
gtgaatggac 
ggtggggtgg 
cctgcagata 
gcctctcgcc 
gtctgtggtt 
ctctaggcga 
attaagctgg 
ccccacagtc 
cccttccttg 
actgaatgcc 
tgtgctcagc 
gtacctgccc 
tgggggcaag 

gacctgtccc 
ccagccgact 
aaatcatcgt 
aggtcttgct 
cttgggagag 
cagataccgg 
agcacccaga 
tccaggctat 
ctgggttgtg 
ttgagttcca 
agagcactgg 
cataaccatc 
aaacatgtgc 
aaaaaagttc 
ctcagtttcc 
ttcctggtaa 
aggttcccca 
gccctcatct 
gccaccttct 
cttctgggct 
ctgtggtctt 
ctgccagagg 
gagctgcaga 
ctgttgggta 
gggtgggccc 
gacatggggt 
gtctcctgtt 
taaatgacct 
ggggggtcag 
gggcttttat 
ggccatcctg 
cctctgcctc 
ttaaccacca 
ctatcacacc 
tttggtccta 
aatcatggtg 
acaactctca 
agccatggaa 
cacccacctc 
agactaggct 
taaaggcggg 
ttcccatatc 
tctcagttga 
tctcaaacca 
gtgacttata 
aagtctgggg 
tgaacagctt 
tctgagtagg 
cccagaggca 
tggcatacac 



tgcaggtcct 
ccaggatggg 
ggtggggtgg 
cagcctgcaa 
atgtggcccg 
agtgctggag 
ctccttccct 
gttttcttgg 
tcagaaggqc 
gtagggccca 
ggcaacatct 
tgcctcctcc 
tacatcagct 
gatccaagga 
tccctcatct 
gaacagccgc 
atacctctct 
gcctgtgctt 
ccacctggct 
cctgagtcat 
ggagggcaag 
ctaaggctcc 
ggtgtggctt 
gcccttcaga 
ttgctcttcc 
cggcagttct 
acagacatac 
atgttctctc 
tcccctggtc 
attggtcaag 
gtggggccca 
gcttgcgcat 
tccaagtctt 
gggtgggcca 
cctacccgca 
cgaccttctc 
aggtgttcac 
ttgcccatca 
cgatgctttc 
ctcatttagc 
ggcattgtag 
gtgtgtgtgt 
cagatgcctt 
gtgtgtgttt 
gaactcattc 
ccagtgctgg 
tcctcctgcc 
cagctaacag 
gggcccaatt 
gaacatcatt 
ggtcaccaag 
gaactctggc 
cccacaatgg 
ggcctcaaac 
tgccatcacc 
agttgttaaa 
gggtttctct 
ccaccaccaa 
aagagaatct 
aacagaatgt 
gctgggtttt 
atgggccacc 
ggacacatct 
gtgtgtgtct 



gcccgaggag 
atggtcaggc 
gaagggtgag 
gatggtccgc 
gaacctgctg 
accaggttcc 
aacttcccag 
gtaagtgggg 
ggcttccctc 
ctcgacagca 
accagaagag 
acattgccta 
acctggtcag 
ccagccccag 
attctgtggt 
aaggaggccg 
gacacgaccc 
ggcttcctca 
gagggggccc 
ggttctgatg 
tttctgtgag 
atagtaagac 
ggtgatggga 
aaaacaaaaa 
agaggatcca 
atggaacctg 
atgcaggcag 
ctacctgtag 
tggactgatc 
tccttcaggg 
ggcccggact 
cgggcaggag 
ctctcatctg 
ggccaggcca 
ggatctgcca 
tgatgggcag 
cctggaaatg 
cgttcctttg 
acctccagag 
ccaggctgac 
tcatgtggca 
gtgatatctg 
gcctgatgag 
gtttgtctgt 
agtagaccaa 
gattaaaggc 
tcagcatctg 
tggatttaaa 
tccacagtga 
accaaatgca 
ggaagtcagg 
ttgttcctca 
tttctctgct 
tcaagagatc 
cctgccccag 
taagaaaact 
tctcaaatga 
caacaataaa 
gaacattttc 
aggggaatat 
gtggcttccc 
ctgtagttgg 
gtcttattct 
ctatagagac 



205860 
205920 
205980 
206040 
206100 
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207480 
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207840 
207900 
207960 
208020 
208080 
208140 
208200 
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agcactcatg 
gacgagaact 
ctaagctctg 
ccctgtaggt 
agggctctat 
catggctagt 
ccactcaggg 
gccccagagt 
caacaggaat 
cgggaactgg 
ccacttccac 
ggccgccctg 
ctggccgctg 
ccagcaccgc 
ctgtgatggg 
tgttcgacgg 
acgtacaatg 
ggacttagaa 
tcagttgctg 
agtatggctt 
tggaggaggg 
cagactgttc 
tgagattgtg 
tttgaagtag 
aagacccttc 
atgcctgccc 
gactgcagga 
tctaggtacc 
cagccctgtg 
ggagggtcag 
gcagtgatgg 
tgggtggagg 
gtgttgtggc 
ttctacgagg 
tgaggtccta 
ggtgtccctt 
agctcaggaa 
cctgatgctc 
ggatcagggg 
ttactgcctg 
gcaggaacag 
cagagctcag 
gcagtgggaa 
acagggaagg 
gtctctaaat 
tagtcccagc 
atgccaggtg 
cagggggttc 
acacacacga 
catgttagta 
tttatttttt 
atcgcaacac 
acagagcaag 
acaaaatatt 
ggctgtcctg 
tgcttctgcc 
ttatagattt 
aggggtcaga 
ccatgctggg 
ctagagtttt 
tttcgagaca 
gctggcctcg 
gtgcgccacc 
ttgttttgtt 



tctacgtatc 
taataaccca 
ggtagatagc 
gacatcatcc 
ctggaaagca 
gccggccctg 
acctttggga 
cctgggccac 
gaagacaacg 
ctggcgtact 
cagatccgcc 
agcagtgaag 
tacaactatg 
aaaagcgtct 
gcagtgcacg 
ctgctttact 
gcgtattttg 
tctgtgggac 
caggcgtctg 
tggcttcttg 
tcagggcaga 
tggggctgga 
ttttctgagg 
ccagggctct 
gcacagtgga 
cacacaccag 
agccaggctt 
agtcaggaaa 
tgggtggggc 
gcctactggt 
atggcgacgg 
gctgaaccct 
tggcttctcc 
ctggccagcc 
tcctttcatt 
ataactactc 
ccagctgcac 
cacagcgatg 
tcacgccctg 
gagaccactg 
gcgcaggcct 
caggttctgg 
ctctcctccc 
caggctgccc 
cccaacattc 
tctcggagtc 
ttctttgaga 
aacacccttt 
acacaaataa 
cgattggatg 
ggtcttaaaa 
tcaggagcag 
tttcagggca 
taaaagattt 
gaactcactg 
tcctaagtgc 
ggtgttttgc 
ggcgagtatc 
actgggtcat 
tttcccccct 
gggtttctct 
aactcagaaa 
acgcctggct 
tttttctttt 



gataaaggaa 
gagtagccca 
taccttgcca 
ggaaaatcat 
tgagcccgag 
aatgggaccc 
gtgtcctggt 
tgggctccct 
ccctgaagcg 
ggcagtacga 
tgcagagctt 
acttctttct 
gggacgggac 
tctacgtggg 
tctgggaccc 
gtgccttagc 
acggggaaga 
ccaagtttga 
ccttctacct 
tacgacgtca 
cggaggtcgc 
cccttgactt 
gtcactgggg 
ttgatagcca 
tccttcagac 
catcaccatg 
gcaggtcagg 
gacactcagg 
gctggcacgg 
ttcgtgggcc 
gcacttgaac 
gggcttgttc 
tcgggcttca 
catgaagggg 
tctacttagg 
ggggacattc 
aggaggcagg 
ttctaatgag 
tccgtgaccc 
agcagagtcc 
gcgcatcgtg 
gaggggagaa 
ccgccgtcca 
ccagctctat 
taagaaagtg 
ttctctgagc 
ggacctgagt 
atggcttctg 
aaataaatgt 
tggcagtgcc 
ttttaaaaga 
aggcaggtgg 
gccaaggtta 
acttacttct 
tatagaccag 
tgagattaaa 
ctgcttgtgt 
agctcttcct 
ttgcaagagt 
ggctgtcctg 
acataggcct 
tccgcctgcc 
cagaatctgt 
ctttttttgt 



gctgttttgg 
gtcagtacct 
tcttccctga 
ccccaaccat 
ctctcgaaac 
tcagagtggg 
tggaaatcgc 
ctctggagtg 
ggagctgcct 
gatcggtgtg 
cccagggcac 
gagtggcagc 
caatgagacg 
ccagcttgag 
cttcacaggt 
caggcctctg 
ttcagtgagg 
atccactccc 
gtaagaacca 
cctgtcgtcg 
agctagttag 
cactgtggaa 
aatgggatgc 
ctaagtcccc 
agccgggtgc 
gccagctccg 
aggggtgcag 
ggactccacc 
atggggcttt 
tgaatggggt 
acaatctcct 
gctcgttggc 
tggtgctcct 
acattctaca 
gcctggtctg 
agtggggtgg 
ctaggggcag 
taacccttgt 
agttcaggtt 
atgccccctg 
ttcctgcctc 
atagggctgc 
gatggaggtg 
ccaggacccc 
tcaggatggc 
accagttctt 
ttggttctca 
tggacacata 
taaaagaaga 
tcacagaatc 
ttggtttagg 
atctctatgg 
cacagagaaa 
tatttgagct 
gctggcacct 
gtagtgttat 
atatatgcac 
ggaactagag 
tacaagtact 
aagtagaatc 
ggctgtcctg 
tctggctctc 
ttttaaatga 
ttttgttttt 



ggggaggaaa 
tgccttggct 
tcttagaact 
gagttggtcg 
ccagccagca 
agctgtctcc 
atccagatcc 
ggtagtagcg 
cggagtgcqc 
agccagcagg 
acgggggccg 
aaggaccgga 
gcttcccgcc 
gccccgcagt 
gagcgggccc 
ggaacgggac 
caggaagaga 
ccaacttacc 
aaaatttaga 
ttgtaaagag 
agcatgctat 
gcagcaagat 
aggtgtgggg 
agatgtgtcc 
ccctgacggc 
actccactct 
ttcctgggct 
aggaacgctg 
tctcttccgg 
gagctgcagt 
cctatagcat 
cgtcagcccc 
agatacccgc 
gatcaaggtg 
ggagaggaca 
gaaaatggcc 
gaatcagggc 
ccatatttgt 
aaataaagcc 
ctgggctgtc 
ctatattcaa 
ttgtgggagg 
aaagacaggc 
aggggaccct 
tctggggtca 
ttcctatggg 
gcactgtcta 
ttcttatgtg 
cagcggcacc 
tgtgggactt 
agtggtggta 
gtttgaggcc 
ctctttctca 
aggatctccc 
taaactcacg 
accatgcccc 
taccttcatg 
ttatggaagg 
tctgagccat 
tgttcttgtt 
gaactcactc 
agaatactgg 
gagtaatagt 
tggctcattt 



caggcttaca 
tctgttgttt 
ttccccactc 
gggagctggc 
tggaacccac 
aggacgatgg 
ctgactctca 
gaggcctcag 
atgggctgag 
atgcccactt 
tcaaatgcgt 
ctgtgcgcct 
tcatctatgc 
atgtggtgag 
aggtgaggcc 
ctagtgcgaa 
agaagagtca 
agcaatcggc 
agattccacg 
aagtatcgag 
gtgaagagag 
gagaaagccc 
tgagttggag 
tttttcagga 
tgtggctgtc 
gcgctttgtg 
actgggggtc 
cagtgacagg 
ggatggagtg 
agggtgggtg 
gagttccgac 
agtggccgga 
acgggcctgg 
actgactgcc 
ggtttatgct 
ctcgtaggcc 
tagaactgac 
cttgcttgga 
aggaggctgt 
ctgatggggg 
tcatagacct 
atttctccct 
actgttgctt 
gggtctcagt 
tcctgggtgt 
aagtaaggac 
gctctggctc 
gcctgcacgc 
ttgtacctca 
tatttattta 
cacaccatta 
agcctggtct 
aaaaataaaa 
tattagccct 
aagatcctcc 
actattttct 
cagtgctaat 
ttgggagtca 
ctccagcccc 
ttgttttgtt 
tgtagaccag 
aattaaaggt 
tacaggtttt 
gttttatttg 



209700 
209760 
209820 
209880 
209940 
210000 
210060 
210120 
210180 
210240 
210300 
210360 
210420 
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210540 
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210780 
210840 
210900 
210960 
211020 
211080 
211140 
211200 
211260 
211320 
211380 
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211800 
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211920 
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212040 
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212220 
212280 
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212520 
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ttttgagaca ggtctcactc tgaaccccta ggtggcctgg agcttgctat gtagaacaca 213540 

ctgactttaa acttgttttc tgagtgctgg atttatgggc ttgtgctatt ttgcccagcc 213600 

tctgatggtt gttaataaca atattattta gcttttcttt tggagatagg ctctcactgt 213660 

atatcaccca gacagtggct ggtctggaaa tcactgtgta ggccaggctg accttgaatt 213720 

cacagagatc tgcctcccga gttcagaaat taaaagcact ctgggatggt tttggagttt 213780 

ggtgagtacc caagcctcca ttgatgctat ctgtccctcc cgctctctgc aggctgtaga 213840 

gggcagcgtg ctcatcagct cctcttccga ccattccttg actgtttgga aggagctgga 213900 

acagaagccc acgcaccact acaagtcagc gtccgaccca atccacacct ttgacctgta 213960 

cggcagcgag gtggtcaccg gcactgtagc caacaagatt ggtgtctgtt ccctgcttga 214020 

gccaccctct caggccacca caaagctcag ttccgagaac ttccgtggca cgctcactag 214080 

tctggctttg ctgcccacga aacgccacct cctgctgggc tcggacaatg gcatcatccg 214140 

cctcctggca tagggccagc caggagttgg ctgagggcag ggcgagatga catctctcag 214200 

ggcccgctcc tcattcttga tctcgaagcc gattcttcta ggcaagcccc aggctctggc 214260 

tacccacatg gcctgctgtc tgggattgca cagctcctga atctccaaag ccttgaagtg 214320 

gcttcatgaa actcgggaga tactgttcct aaccagcaag aattggggca aggaaagcac 214380 

tgtgatcccc attgctcccc agttctgcct tctggattca catggggaca gggcagctcc 214440 

aggaaatgaa aggagttggg cctttgctca gccagcttcc tctagccacg ctctccttag 214500 

ctctgtttct cccttgggta ggaaactgct cctgtctagg gttctgatgg tactgggact 214560 

ccaggctcag gagggctggc caggacctac gactttcagg gcttggtctg gggttttagc 214620 

attcattcag ccaggtcttc agtatgggac cagaaaaaag gggatgtgag aacagggcta 214680 

gggaaggggt tatatgggcc cagctggtcc aggaatgaat ccatgccttg ccttggtacc 214740 

cctaaccaca gcgtttgtgc cttcagccgg ggaggcagcc cttgggacca gcatccctag 214800 

ggacaggagg cagcgggaat catctctgta tctcgggttc tgcccagggg atgggcagac 214860 

tctgccatct cttgagtgtt cgtttggaga agcctgagat gtggcccctg ctgccttctc 214920 

actagttgca gtctatgtaa ataaggtcaa taaattcttt ggaagagcca cggagctgag 214980 

tgaggctgtg ttgtgttttg ctttgcctag gctgggctca ggcagctctg cctcagcctc 215040 

ccaaggagct ggggaactgg tatatgtcac tgtatatgtc ■ actgtgcctg gcttatggct 215100 

tggcttggct ttttttcaga tggtctcaag tgcctcaggt tggccttgat cttgggatga 215160 

ccttcctgct tgaaacagag tagtgggctt ataggcatga cccaccaggt ccaattttta 215220 

ttttttaaag gcattgattt ttatacgtgt atggttgttt tgcccacttg tacatatgca 215280 

caccatactt gtgtctggtc cctgcggagg tcagaagagg gcatcgggat cacctggaac 215340 

cgaagttaat gaatggttat gagccacatc tcgatgctga agattgaacc tggatccttt 215400 

gcaagagcag ccagtgttct tacccactga gccatctcta agccccacac ccagcttctt 215460 

ttgatacaag gtctggtagc tcaaacttga tatgcagccg aggaggttga cctggtattc 215520 

cctacctacc ctcttctctc taccttccaa gtgctgatat tatacatagg catggatagt 215580 

catgcccacc agtttgcctt gatggcacca gagtcaggaa agtccaaacc tggtagttgc 215640 

aaacacagca agagggtaga ggcagccatt gtcctctggc tgccttggat acagagcttc 215700 

tgggttgggt ggccttgggt cagttttccg aatggttcac ccttggggaa agggaacact 215760 

gctgaagagg tgggaccctg ggagggccgg cctccagctg ggtctctcca gccctcgcct 215820 

tggaacctag gctggaggga gccaaccagg atcctggact tgctacagtt aggtgaacag 215880 

gctcctgcag cctccccttc ccttgggtag ctgtggtggt ggtggtggtg gtggtggtgg 215940 

tggtggtggt ggtggtggtg gtgggggggg gggngnngnt 215980 



<210> 17 
<211> 473 
<212> PRT 
<213> Mus sp. 

<400> 17 

Met Lys Arg Ala Ser Ser Gly Gly Ser Arg Leu Leu Ala Trp Val Leu 
15 10 15 

Trp Leu Gin Ala Trp Arg Val Ala Thr Pro Cys Pro Gly Ala Cys Val 
20 25 30 

Cys Tyr Asn Glu Pro Lys Val Thr Thr Ser Cys Pro Gin Gin Gly Leu 
35 40 45 

Gin Ala Val Pro Thr Gly He Pro Ala Ser Ser Gin Arg He Phe Leu 
50 55 60 

His Gly Asn Arg He Ser His Val Pro Ala Ala Ser Phe Gin Ser Cys 
65 70 75 80 
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Arg Asn Leu Thr lie Leu Trp Leu His Ser Asn Ala Leu Ala Arg lie 
85 90 95 

Asp Ala Ala Ala Phe Thr Gly Leu Thr Leu Leu Glu Gin Leu Asp Leu 
100 105 110 

Ser Asp Asn Ala Gin Leu His Val Val Asp Pro Thr Thr Phe His Gly 
115 120 125 

Leu Gly His Leu His Thr Leu His Leu Asp Arg Cys Gly Leu Arg Glu 
130 135 140 

Leu Gly Pro Gly Leu Phe Arg Gly Leu Ala Ala Leu Gin Tyr Leu Tyr 
145 150 155 160 

Leu Gin Asp Asn Asn Leu Gin Ala Leu Pro Asp Asn Thr Phe Arg Asp 
165 170 175 

Leu Gly Asn Leu Thr His Leu Phe Leu His Gly Asn Arg lie Pro Ser 
180 185 ~ 190 

Val Pro Glu His Ala Phe Arg Gly Leu His Ser Leu Asp Arg Leu Leu 
195 200 205 

Leu His Gin Asn His Val Ala Arg Val His Pro His Ala Phe Arg Asp 
210 215 220 

Leu Gly Arg Leu Met Thr Leu Tyr Leu Phe Ala Asn Asn Leu Ser Met 
225 230 235' 240 

Leu Pro Ala Glu Val Leu Met Pro Leu Arg Ser Leu Gin Tyr Leu Arg 
245 250 255 

Leu Asn Asp Asn Pro Trp Val Cys Asp Cys Arg Ala Arg Pro Leu Trp 
260 265 " 270 

Ala Trp Leu Gin Lys Phe Arg Gly Ser Ser Ser Glu Val Pro Cys Asn 
275 280 285 

Leu Pro Gin Arg Leu Ala Asp Arg Asp Leu Lys Arg Leu Ala Ala Ser 
290 295 300 

Asp Leu Glu Gly Cys Ala Val Ala Ser Gly Pro Phe Arg Pro He Gin 
305 310 315 ~ 320 

Thr Ser Gin Leu Thr Asp Glu Glu Leu Leu Ser Leu Pro Lys Cys Cys 
325 330 335 

Gin Pro Asp Ala Ala Asp Lys Ala Ser Val Leu Glu Pro Gly Arg Pro 
340 345 350 

Ala Ser Ala Gly Asn Ala Leu Lys Gly Arg Val Pro Pro Gly Asp Thr 
355 360 365 

Pro Pro Gly Asn Gly Ser Gly Pro Arg His He Asn Asp Ser Pro Phe 
370 * 375 380 

Gly Thr Leu Pro Ser Ser Ala Glu Pro Pro Leu Thr Ala Leu Arg Pro 
385 390 395 400 

Gly Gly Ser Glu Pro Pro Gly Leu Pro Thr Thr Gly Pro Arg Arg Arg 
405 410 415 



Pro Gly Cys Ser Arg Lys Asn Arg Thr Arg Ser His Cys Arg Leu Gly 
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420 

Gin Ala Gly Ser Gly Ala 
435 

Ala Leu Pro Ala Leu Ala 
450 

Val Leu Trp Thr Val Leu 
465 470 



425 

Ser Gly Thr Gly Asp Ala 
440 

Cys Ser Leu Ala Pro Leu 
455 460 

Gly Pro Cys 



PCT/US01/31488 

430 

Glu Gly Ser Gly 
445 

Gly Leu Ala Leu 



127 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record. 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 
□^BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

U COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□flJNES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



